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INTRODUCTION 


Modern psychology is a field that involves thousands of active re- 
searchers in laboratories, clinics, hospitals, and universities all over the 
world. The kinds of issues with which psychologists deal range from the 
treatment of schizophrenics to the function of individual brain cells. 
The subject matter is so diverse that it spills over into related fields such 
as biology, sociology, anthropology, philosophy, and law. In addition, 
many of the concerns of psychologists—for example, the effect on the 
viewer of violence on television or the use to which intelligence tests 
are put—are also of immediate concern to large numbers of nonpsychol- 
ogists. Just about anything related to human behavior is being studied 
by some psychologist somewhere. The number of experiments and re- 
Search reports grows continually, producing a torrent of new articles 
each year. 

How can the beginning student gain a toehold in such a diverse field? 
I think that the best introduction is to read selected articles from the 
psychological literature. Thousands of articles are published each year 
in hundreds of periodicals in dozens of languages. Obviously, it is impos- 
sible for anyone to read more than a tiny fraction of these articles. What 
to do? Clearly, one must rely on some sort of sampling procedure, per- 
haps skimming many articles in order to select a more manageable num- 
ber of key articles to read in greater depth. For the beginning student 
this is too often an insurmountable problem, as he or she does not have 
the experience or guidance to differentiate good from bad from indiffer- 
ent research. Yet sooner or later (and I believe sooner is better) the 
serious student must come to grips with the original sources that form 
the mainstreams of modern psychological thought. Textbooks are useful 
for providing an organized approach to a field, but they must necessarily 
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be general. Textbooks cannot be expected to provide the full flavor ae 
research on which they are based. This book is an attempt to provide e 
beginning student with a wedge into the current psychological literature. 
It is designed to be used in conjunction with any general introductory text- 
book but can, of course, be used in other ways as well. | 
The title of this collection of articles, Contrast and Controversy in 
Modern Psychology, reflects the hope that the selected articles represent 
areas of major concern to psychologists and others and are of more than 
simply historical or purely academic interest. Many of these articles 
deal with issues whose relevance spreads far beyond their origins, into 
therapy situations, into discussions concerning violence on television, 
and into the legislatures and courtrooms faced with decisions about 
psychosurgery. Thus one of the major biases in the selection of these 
articles was that they deal with issues that have widespread implications 
for our contemporary society. Informed citizens are necessary if we are 
to enact sensible laws and improve the quality of life, and the selection 
of the following articles reflects this concern, " 
There were other biases in the selection process too. I have tried to 
avoid the trivial but not the controversial. I have selected research that 
directly relates to human behavior, even though a substantial part of 
the psychological literature is based on work with experimental animals. 
I have tried to choose current issues rather than issues that are simply 
of historieal importance. As I did this, it became apparent that many 
of the current issues are, in faet, ^old wine in new bottles." That is, 
many of the concerns with which psychologists tried to come to grips 
in the past are still evident in these articles. This was a bonus, as there 
was not a conscious effort to achieve this end. I have tried to select arti- 
cles that are, on the whole, well written and do not require advanced 
technical knowledge. If scientists Stray too far in their writing and lose 
their nonscientist audience completely, there will be serious negative 
consequences. Of course, no one article will score 100% on all the pre- 
ceding selection criteria. Finally, I have restricted the selections to 
articles published originally in English. 
I sincerely hope this book will leave you frustrated. If so, you will want 
to get more of the available information, to dig deeper into one or more 
of the areas represented in these selections. If 


this book entices you into 
further reading, 


then it will have succeeded in its primary goal. Sugges- 
tions for further reading are to be found in the articles themselves and 
sometimes in the introductory remarks that precede each section. 

I would like to acknowledge the help of the following individuals 


in the preparation of this book: Sheldon Cohen, Ann Elsner, Robert 
Grimm, Edward Lichtenstein, David Moody, Virginia Parr, and Wayne 
Wickelgren. 


1. INTELLIGENCE: 


Is It Just What 
Intelligence Tests 
Measure? 


Perhaps no topic in psychology has been more controversial or explosive 
than intelligence testing. From the time of Francis Galton (1822-1911), 
controversies have raged over such questions as, What is intelligence? 
How do you measure it (assuming you can define it)? Is there a global 
general intelligence factor? Is it more realistic to try to measure individual 
skills and abilities? From the beginning of mental testing, the practical 
applications were apparent, and in fact, the development of many of the 
existing intelligence tests has occurred in the context of pressing prac- 
tical needs. Current interest in intelligence testing is high. Arguments 
rage over almost every aspect of intelligence testing: how to do it, 
whether to do it at all, to what use to put the results. These arguments 
are not simply ivory-tower debates. Entry into colleges and universities 
and into numerous occupations currently depends, in part, on an indi- 
vidual’s scores on some intelligence test. 

The battle over intelligence testing is waged on several fronts, both 
theoretical and practical. On the theoretical side, Quinn МеМетаг in the 
first of the following two articles favors the concept that there is a gen- 
eral intelligence factor (or perhaps two fundamental factors, verbal and 
mathematical), while David McClelland comes out strongly against this 
concept, instead favoring tests that assess various competencies that 
differ according to real-life situations. 

Woven into these articles are terms and concepts that may be unfamil- 
iar to you. Such terms as factor analysis and test validity are used in 
these discussions. These should be good items for discussion with your 
instructor. Do not let jargon keep you from wrestling with these papers. 
While these two articles are remarkably free of the typical jargon of 
social scientists (one of the reasons they were selected), a certain amount 
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of specialized vocabulary is unavoidable. I suspect there is simply no way 
to penetrate the scientific literature in any field without encountering 
a certain amount of new terminology. But we cannot let this terminology 
problem keep us away from this literature, or we will run the risk ofa 
new priesthood of scientists who converse only with one another in their 
specialized secret language. This would be an intolerable situation for the 
great majority of people who are nonscientists (or scientists in only a 
narrow range of subjects). I think it would also be hazardous for the 


pened to some extent, we should resist further erosion of such important 
lines of communication, 


LOST: OUR INTELLIGENCE? WHY? 
Quinn McNemar, Stanford University 


survival properties. Regardless of the word, what is now called intelli- 
gence has been talked about for at least 2,000 years. And as long as 2,000 
years before the advent of attempts to measure intelligence, there seems 
to have been recognition of the fact that individuals differ in intellectual 


The earlier attempts at measuring were based on either of two quite 
distinct conceptions: the Galton-Cattell idea that intellectual ability 
manifests itself in simple, discrimination functioning, and the Binet no- 


tests of so-called general intelligence, 
A few words about definition may be in order. First, it might be claimed 


lightening. Third, that confusion might have been 
from a recent reexamination of the problem of defi 


Address of the President to the Seventy-Second Annual Conv. 
Psychological Association, Los Angeles, September 5, 1964. 

MeNemar, Quinn. Lost: Our intelligence? Why? American Psychologist, 1964, 19, 
871-882. Copyright 1964 by the American Psychological Association, Reprinted by 
permission. 


ention of the American 
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This British chappie found himself struggling with the awful fact that 
the word “definition” itself has 12 definitions. Perhaps the resolution of 
this problem should be assigned to the newly formed Division of Philo- 
sophical Psychology, or maybe the problem should be forgotten since 
psychologists seem to have lost the concept of general intelligence. 


been tested on general intelligence tests were measured for a nonexistent 
function? If it is possible that the notion of general intelligence is not lost 
but merely gone astray, in what corners of what psychological fields 


REASONS FOR DISCARDING THE IDEA OF GENERAL INTELLIGENCE 


Apparently one reason why concepts are either discarded or modified 
beyond recognition is that too much is claimed for them. Among the 


school achievement; the idea that IQ differences are incompatible with 
democracy and lead to educational determinism; and finally, the great 


method that Spearman's g became, seemingly, nonexistent. Thurstone 
said, ^We have not found the general factor of Spearman,” and “we 
cannot report any general common factor in the battery of fifty-six tests 
[p. vii].” As anticipated by some, Spearman was not prone to admit 
defeat. He reworked Thurstone’s data, and a g was found, plus some 
group factors. He charged that Thurstone’s rotational process had sub- 


American factorists found Thurstone convincing. The description of 
abilities in terms of seven primaries was an attractive package. The зо- 
called primaries were more amenable to specific definition than the old 
hodgepodge called general intelligence. Despite the fact that Thurstone 
was able to replicate his findings on samples from two other populations, 
thus giving credence to his method and results, there were a couple of 
events that led to some turbulence in his seven-dimension rarified atmos- 
phere. The first of these was a minor study by one of his own students, 
based on the intercorrelations Stanford-Binet items of 1916, in which the 
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g refused to be rotated out. But rather than admit that this might be 
some kind of general intelligence, the author renamed it maturational 
level.” Incidentally, this illustrates the first cardinal Principle of Psy- 
chological Progress: Give new names to old things. 

The second disturber of the neat little set of primaries, sans a g, re- 
sulted when Thurstone took the next logical step, that of constructing 
tests to measure the primaries. It was found that the primaries were 
themselves intercorrelated, whereas it had at the time been expected 
and hoped that they would be independent. The Thurstones (1941, p. 26) 
readily admitted that a general factor was needed to explain the inter- 
relatedness of the primaries. This eventually led to the idea of oblique 
axes, which axes were regarded as representing the primaries as first- 
order factors, whereas the general factor pervading the primaries was 
dubbed a second-order factor. It began to look as though Spearman was 
being revisited, except for the little matter of labeling: anything called 
second-order could not possibly be regarded as of much importance. 
Furthermore, it could always be said that in the ability domain it is less 
difficult to attribute psychological meaningfulness to first-order than to 
second-order factors, so why pay much attention to the latter? Thus it 
was easy for most American factorists to drop the concept of general 
intelligence and to advocate that tests thereof, despite their proven 
usefulness over the years, should be replaced by tests of the primaries. 
Hence the emergence of differential aptitude batteries, about which more 
later. 

Meanwhile, our British cousins did not tag along with the factor meth- 
ods preferred on this side of the Atlantic. After all, it is possible to use 
factor methods that permit a sizable general factor, if such exists, to 
emerge as the very first factor. Being first, it is, presto, the most impor- 
tant, as indeed it is as a factor explaining, for the starting battery as a 
whole, more variance for more tests than attributable to any American- 
style primary factor. The methods preferred by the British also yield 
group factors, apt to bear the same name as the primaries, but of attenu- 
ated importance. Apparently, the British are Skeptical of the multitude 
of ability factors being “discovered” in America. The structure of intellect 
that requires 120 factors may very well lead the British, and some of the 
rest of us, to regard our fractionization and fragmentation of ability into 
more and more factors of less and less importance as indicative of scatter- 
brainedness. This statement presumes that intellectual abilities are 
brain centered. 

In practically all areas of psychological research, the demonstration 
of trivially small minutiae is doomed to failure because of random errors. 
Not so if your technique is factor analysis, despite its being based on the 
correlation coefficient —that slipperiest of all statistical measures. By 
some magie, hypotheses are tested without significance tests. This happy 


situation permits me to announce a Principle of Psychological Regress: 
Use Statistical techniques that lack inferential power. This will not 
inhibit your power of subjectiv 


e inference and consequently will progress 
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you right back to the good old days when there was no strangling stat or 
sticky stix to make your insignificant data insignificant. 

It may be a long time before we have an ivory-tower, strictly scientific 
resolution of the issue as to whether a scheme involving primary abilities 
plus a deemphasized g is preferable to one involving an emphasized g 
plus group factors. With bigger and better computers we will have big- 
ger, though not necessarily better, factor-analytie studies, but it seems 
unlikely that such further studies will, in and of themselves, settle the is- 
sue under discussion. Until such time as some genius resolves the broader 
question, so ably discussed by Lee Cronbach in 1957, of the place, if 
any, of correlational method in a science that aspires to be experimental, 
we may have to turn to the criterion of social usefulness as a basis for 
judging whether it is wise to discard general intelligence. Like it or not, 
much of our heritage in this area is that earlier workers, from Binet on, 
had as their motivation the solution of social problems, and currently 
many in the area have a similar motivation. 


THE BEARING OF SOCIAL USEFULNESS 


In practice, if you believe that the concept of general intelligence has 
outlived its usefulness, you may choose from among several differential, 
or multiple, aptitude batteries which will provide measures of some of the 
so-called primary mental abilities. If you happen to believe that there is 
something to general ability, you can find tests to use. The novice looking 
for these latter tests may have to alert himself to the first Principle of 
Psychological Progress: The test labels may have changed from “general 
intelligence” to "general classification" or “scholastic aptitude." If you 
enjoy riding the fence, you might become a devotee of the practice of 
the College Board and others and measure just two abilities: Verbal 
and Quantitative. 

This is certainly not the place to review the voluminous literature that 
amply demonstrates the practical utility of tests of general intelligence. 
Nor is it the place to catalog the misuses of the Stanford-Binet for pur- 
poses which Terman never claimed for it, or the misuses of the Wechsler 
scales for purposes which Wechsler has claimed for his scales. Neither 
the Binet nor the Wechsler provides a factorially pure, unidimensional 
measure of a g. The current Stanford-Binet was in reality constructed 
too early to benefit from the implication of factor analysis for test purity, 
whereas the Wechsler scales were based on the impossible premise that 
10 or 11 subtests can simultaneously provide diagnostic subscores and a 
meaningful total score. Of the many group tests that apeared between 
1920 and 1945 it can be said that few, if any, provide unidimensional 
measures of general intelligence. The chief difficulty is that most of them 
lead to a total score based on a mixture of verbal and mathematical ma- 
terial. Thus with two main sources of variance, marked qualitative dif- 
ferences can exist for quantitatively similar total scores. The College 
Board Educational Testing Service people have justifiably refrained from 
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giving a total score involving verbal plus math, but there are those who 
question the usefulness of the Board’s math score, and there are those 
who criticize the Educational Testing Service for failing to change over 
to a differential aptitude battery. 

Let us next turn to a somewhat more detailed examination of the vari- 
ous so-called multiple aptitude batteries. What and who influenced whom 
in the development of these batteries is difficult to disentangle. At the 
risk of oversimplification, it might be said that two prime influences 
operated. 

First, the early factor studies by the Thurstones, by Holzinger, and 
by Guilford are the progenitors of the Science Research Associates’ 
Primary Mental Abilities (PMA) Test, the Holzinger-Crowder Unifactor 
Tests, the Guilford-Zimmerman Aptitude Survey, and the Segel-Raskin 
Multiple Aptitude Tests (MAT). 

The second influence, which seems to have emerged from testing 
experience in the Armed Services during World War II, is the job- 
element approach, an approach which may or may not differ from the old 
job-analysis method. For whatever jobs you are dealing with, you study 
the activities involved in order to decide what aptitudes are called for. 
Whether or not these aptitudes have been previously isolated by factor 
analysis is totally irrelevant. It is hoped that some jobs will have apti- 
tudes in common so that the needed number of tests will be less than the 
number of jobs. The one battery that is built on this approach is the 
Flanagan Aptitude Classification Tests, a battery that just happens to 
have the catchy abbreviation FACT. If we cannot muster any facts in 
psychology, we can at least have FACT scores! 

A cross between testing for factorially defined abilities and job-element 
derived aptitudes is apparently involved in the General Aptitude Test 
Battery (GATB) of the United States Employment Service and the 
Differential Aptitudes Tests (DAT) of the Psychological Corporation, 
Since in both batteries some of the tests seem to have sprung from factor- 


analysis results and some tests seem to have been thrown in as possible 
predictors of specific performances. 
It is not our pur 


test batteries, but 


the other three batterie 
Some will have noted th 
ignored here. 
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Now to get back to our main theme, to what extent have the 7 batteries 
contributed to the demise of general intelligence? In attempting to an- 
swer this, one encounters a paradox: Some test authors want to eat their 
cake and have it too. They attempt to measure factors and g with the 
same instrument. This is understandable in a couple of instances. Three 
of the 15 tests of the Employment Service’s GATB were included to 
provide a measure of general intelligence, apparently because the au- 
thors still saw some merit in a g and were not committed to the factor 
schemata. Holzinger and Crowder suggest a weighted score for a mea- 
sure of g, perhaps because of Holzinger's long-time alignment with Spear- 
man. The real teaser is why Thurstone ever sanctioned, if he did, the 
summing of the Science Research Associates’ PMA scores to obtain an 
IQ. One has the uncomfortable feeling that his publishers wished to g 
garnish the factor cake to make it more palatable in the marketplace. 

Although Segel says nothing in his 1957 article about a general score 
from the Segel-Raskin MAT, the test publishers say that, in addition to 
yielding scores for four factors, it also provides a “Scholastic Potential” 
Score. Perhaps Flanagan has not completely broken with tradition, since 
he states that four tests of the FACT battery measure “General College 
Aptitude"—a statement made with the same lack of empirical validity 
as the claim, which should be anxiety producing for those of you who fly 
a certain airline, that your highly paid pilot shares four of the aptitudes 
of a plumber! 

Apparently, Guilford and Zimmerman and the test people at the Psy- 
chological Corporation are willing to stick to the sound principle that a 
differential test battery cannot provide factor scores that can be summed 
to obtain a meaningful IQ, or measure of a g. 

Parenthetieally, it might be said that the California Test of Mental 
Maturity (CTMM), which, according to the publisher’s 1963 catalog, was 
originally “designed as a group test of intelligence patterned after the 
individual Stanford-Binet,” serves as an illustration of factor icing a g 
cake. Some multitest batteries and the CTMM have a Madison Avenue 
advantage: The advertising claims the measurement of not only factors 
but also g; not only g but also factors. This measurement absurdity is 
all too apt to go unrecognized by many test users and hence a sales ad- 
vantage for the aptitude battery that produces both factor scores and 
an IQ. 

Just how successful have the multitest batteries been? Since by far 
the most extensive social use of tests has been, and continues to be, 
in the schools, let us look at the evidence of validity studies therein. 
As indicated previously, little is known about the predictive usefulness 
of some of the seven batteries discussed above. The DAT of the Psy- 
chological Corporation is the only battery for which adequate predictive 
(and concurrent) validity data, derived from school sources, are available. 
It is also the battery that has fared best in the hands of the test reviewers; 
therefore, if we allow the case for differential batteries to rest thereon, 
we will be looking at the best. So what is the story? 
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Recall that the hoped-for advantage of a multitest battery over the 
old-fashioned general intelligence test was that it would have greater 
predictive power, a power which could manifest itself in higher validity 
coefficients for specific subject matter and, perhaps, for overall achieve- 
ment. It was hoped that such a battery would be truly differential in that 
particular factors (or subtests) would correlate higher with achievement 
in some areas than in other areas. Presumably each factor (or subtest) 
should have unique usefulness. If a battery were truly differential, it 
would be a boon to school guidance personnel. 

Now the manual of the DAT of the Psychological Corporation contains 
a staggering total of 4,096—yes, I counted 'em— validity coefficients. 
With such a large pool to draw from, one could by gracious selection 
"show" that the DAT is the answer to the prayer of every counselor, 
male or female, or by malicious selection one could "prove" that the DAT 
is far worse than any test ever published. The validity coefficients range 
all the way down to —.37, which is presumably a chance deviation down- 
ward from 0, and all the way up to -90, which is likely not a chance devia- 
tion downward from unity. But ranges tell us nothing. After a careful 
perusal of the 4,096 correlations, it seems safe to summarize DAT validi- 
ties as follows: 

1. Verbal Reasoning (analogies to most of you) is the best single pre- 
dictor; Language Usage, as represented by a sentence test dealing with 


grammar and word usage, and admittedly more achievement than apti- 
tude, is a close second. 


2. Numerical Ability, 
tions, designed to tap 
component, is the best 
It does not, however, 
Verbal Reasoning. 

3. Aside from the Numerical 
Shows differential p 
spell, you may hav 

4. The remainin 


as measured by a test of simple arithmetic opera- 
arithmetie reasoning without the usual verbal 
predictor of achievement in school mathematics. 
correlate as well with grades in science as does 


Ability test, the only other test that 
ower as a predictor is the Spelling test. If you cannot 


ds to provide some support 


for the Educational Testing Se Board practice of providing 


| rvice-College 
Scores for just these two abili 


ties. 
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And now we come to a very disturbing aspect of the situation. Those 
who have constructed and marketed multiple aptitude batteries and 
advocated that they be used instead of tests of general intelligence seem 
never to have bothered to demonstrate whether or not multitest bat- 
teries provide better predictions than the old-fashioned scale of general 
intelligence. Be it noted that we are not discussing experimental editions 
of tests. Some say that insofar as a test publisher provides validity data 
for a new battery it is not necessary to show that the validities are, for 
the given school condition, better than those of other tests. With this, 
one can agree, but only in case no claims are made, explicitly or implic- 
itly, regarding superior merits for the new battery. 

It is far from clear that tests of general intelligence have been out- 
moded by the multitest batteries as the more useful predictors of school 
achievement. Indeed, one can use the vast accumulation of data on the 
validity of the Psychological Corporations DAT to show that better 
predictions are possible via old-fashioned general intelligence tests. 
Consider the fact that a combination of the tests Verbal Reasoning (anal- 
ogies) and Numerical Ability would be, in terms of content, very similar 
to many group tests of general intelligence. Consider also that an equally 
weighted combination of these two tests correlates in the mid-.80s with 
the Otis S-A, Higher Form. Then when you turn to a careful study of the 
empirical validities as reported in the DAT manual, you will not be sur- 
prised at the outcome of the application of a little arithmetie, which leads 
to the definite conclusion that a simple unweighted combination of the 
Verbal Reasoning and Numerical Ability tests predicts as well as or, in 
most instances, better than any subtest taken singly or in the differen- 
tial sense. 

The manual for the DAT contains the following statement (Bennett, 
Seashore, & Wesman, 1952): 


Apparently the Verbal Reasoning and Numerical Ability tests can serve most 
purposes for which a general mental ability test is usually given in addition to 
providing differential clues useful to the counselor. Hence, the use of the so-called 
intelligence test is apparently unnecessary where the Differential Aptitude Tests 
are already being used [p. 71]. 


Anyone who disagrees with this quotation could with better justification 
say that an intelligence test can serve nearly all, if not all, the purposes 
for which a multiple aptitude battery is given in the schools because the 
former, in general, is a better predictor and because, as we saw earlier, 
the differential clues are too fragmentary to be of use to the counselor. 
And there is a bonus: one classroom period of testing compared to six 
periods. A second bonus: much less costly. A third bonus: fewer scores 
to confuse the already confused minds of most school counselors. 

Thus we come to the conclusion that general intelligence has not been 
lost in the trend to test more and more abilities; it was merely misplaced 
by a misplaced emphasis on a hope that a lot of us, including the speaker, 
once entertained, a hope that in turn was based on a misplaced faith in 
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factor analysis: the hope that factors, when and if measured, would find 
great usefulness in the affairs of society. By the criterion of social useful- 
ness, the multiple aptitude batteries have been found wanting. Now I 
have no desire to furnish ammunition for those test critics who would 
have us stop all testing merely because they find a trivially faulty item 
in a standardized test. At a time when there is shouting about the tyr- 
anny of the testers and the brass of the brain watchers, at a time when 
school people are showing resentment at the disruption caused by too 
many national testing programs, at a time when federal and state legis- 
lators are all too willing to write legislation that places restrictions on the 
use of tests, and at a time when both majorities and minorities are be- 
ing denied the benefits of test-based guidance because certain well- 
intentioned persons fail to realize that scores for the underprivileged 
minorities are useful indices of immediate, or present, functioning—at 
a time when all these and other forces are operating to throw out the 
tests, it is high time for the profession to establish a bureau of standards 
to test the tests instead of coasting down a road that is tinged with some 
of the trappings of Madison Avenue. Better to have informed internal 
control than ignorant, hostile, external control. 


INTELLIGENCE ELSEWHERE? 


sures of factor-analytie defined abilities. On 
Most of you will have 
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is known about the creative process that measuring instruments are 
seemingly chosen on a trial-and-error basis. 

We might presume that the role, if any, of general intelligence in crea- 
tivity would increase as we pass from art to music, to architecture, to 
literature and drama, to science. Your presumption about the ordering 
may be different and more nearly correct. I would like to discuss briefly 
the extremes of my ordering. 

At the risk of being called a heretic and a has-been statistician, I would 
like first to resort to the single case of a painter, examples of whose 
works were reproduced in color recently by Desmond Morris (1962) in 
a journal called Portfolio and Art News Annual. To my uncultured eye 
these paintings have the general appearance of the so-called school of 
modern art, and the running comment on the paintings involves what 
I must presume is the jargon of contemporary art critics: talk about self- 
rewarding activities, compositional control, calligraphic differentiation, 
thematic variation, optimum heterogeneity, and universal imagery. Since 
authors may use pen names, I would guess that this painter is using 
“Congon” as a “brush” name. Supposedly, by now some of you will be 
guessing that this single case is of interest in the context of this paper 
because of Congon’s IQ. Well, because of this painter’s underprivileged 
cultural background, no test scores are available. Congon, despite strik- 
ing contribution to art, happens to be a chimp. Aside from a rather ob- 
vious conclusion, one wonders what would emerge from a blind (as to 
source) analysis of Congon’s paintings by the personality boys. We might 
even tell them that Congon was breast fed. 

Without in any way implying that creativity in the arts is unimportant, 
we hasten on to scientific creativity, a specific area in which it seems 
likely, because of the Sputnik-inspired spurt of interest, that we can 
learn something of the role, if any, of general intelligence. But immediately 
we encounter skulls that have been cracked on the criterion problem. 

One elaborate study (C.W. Taylor, Smith, Ghiselin, and Ellison, 1961) 
on a sample of 166 physical scientists working at Air Force research 
centers came up with 150—yes, believe it or not—150 criteria of scien- 
tifie productivity and creativity. By combining some scores and elimi- 
nating others, the number of criteria was reduced to 48. A factor analysis 
of the intercorrelations of the 48 reduced the number to 14 “categories.” 
Apparently the 150 original criterion measures included everything 
except success at turning on a kitchen faucet, so one need not be sur- 
prised at the outcome of the factor analysis. For example, one factor- 
derived criterion of scientific productivity and creativity is "likableness"; 
another is “status seeking”; another is extent of membership in scientific 
and professional societies—the joiners, no doubt. 

The fact that the intercorrelations among the 14 criterion categories 
derived from factor analysis range from —.08 to +.55, with a median of 
only .18, indicates either criterion complexity or else a whale of a lot of 
vagueness as to what is meant by productivity and creativity in science. 
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Now this criterion mess emerged from a study of interview results of 166 
physical scientists, but nearly half of these so-called scientists were engi- 
neers, and the education of the total group indicates only two years of 
graduate work on the average; so when is a scientist a scientist a scientist? 

The next step in this study was to collect data on 107 of these so-called 
scientists, for a whopping total of 130 potential predictors, which, when 
pitted against 17 criterion measures, produced 2,210 validity coefficients. 
The distribution of these, excluding 30 values involving un-cross-validated 
empirical keys, almost restores one’s faith in the random-sampling dis- 
tribution of correlation coefficients around zero! There were 16 predictors 
based on aptitude tests, hence 16 x 17, or 272, validities for this area. 
Since only 4% of them reach the 5% level, we can do no more than accept 
the null hypothesis: Aptitude ain’t important in scientific productivity 
and creativity. The idea that some scientists are more equal in ability 
than others apparently is not true. 

But this criterion- 
the role of general i 


ormance was concerned with the creativity of architects. Although 
the author reports that within a creati d Pai iic 
tially zero b i 
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artistic designs, then the intelligence component would likely be of less 
importance. It would seem that when the author says we “may have over- 
estimated . . . the role of intelligence in creative achievement” [р. 493], 
he should have included some marked qualifications as to what type of 
creativity he had in mind. 

That such qualification is indeed necessary is supplied by a funding of 
still another investigator (Barron, 1963). For a group of highly creative 
writers it was estimated, by way of the Terman CMT, that their average 
IQ is about 140, which we interpret as meaning that a high IQ is a neces- 
sary though not sufficient condition for outstanding success as a writer. 
On the basis of his own studies and those of other persons, this same 
investigator suggests that “over the total range of intelligence and crea- 
tivity a low positive correlation" of .40 probably obtains. This sweeping 
generalization is for all areas of creativity. 

And speaking of sweeping generalizations, consider the suggestion in 
a 1961 study (Holland, 1961) that *we need to use nonintellectual criteria 
in the selection of students for scholarships and fellowships [p. 146].” The 
author did not say so, but presumably he meant in addition to intellectual 
ability; maybe he did not, since he had previously concluded that “intel- 
ligence has little or no relationship to creative performance in arts and 
science [p. 143]" at the high school level. His data back up this conclusion, 
as might have been expected when correlations are based on groups 
restricted in range to the top 1%! 

If the foregoing examples of criterion-based studies of creativity seem 
to indicate that general intelligence is relatively unimportant for crea- 
tivity, it should be remembered that drastic but unknown or unspecified 
curtailment of range exists for both ability and criteria. Why do correla- 
tional studies under such adverse circumstances? 

Next, we turn to a few studies of creativity which cannot be criticized 
because of restriction of range on the criteria. These studies simply avoid 
this problem by never having actual criterion information. The approach 
is to claim that certain tests, which typically are scored for novel re- 
Sponses or novel solutions to problems, ave measures of creativity, with 
no evidence whatsoever that the tests have predictive validity for non- 
test, real-life creative performance. This bit of ignorance does not prove 
to be a handicap to those who think that creativity can be studied without 
the nuisance of obtaining criterion measures. We reluctantly accept the 
test-based criteria solely for the sake of seeing what happens to general 
intelligence as a part of the picture. Time permits only three examples. 

We first note that general intelligence has not manifested itself as a 
correlate of so-called creativity tests in the factor-analytic studies of 
creativity. The explanation for this is easily found—no measures of 
general intelligence are used in these studies. When discussing his plans 
for studying creativity, a certain author (Guilford, 1950) said that “we 
must look well beyond the boundaries of the IQ if we are to fathom the 
domain of creativity [p. 448]." He went on to say the conception “that 
creative talent is to be accounted for in terms of high intelligence or 
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19...15 not only inadequate but has been largely responsible for lack 
of progress in the understanding of creative people [p. 454].” With a part 
of this, one can agree, but does it follow that one should prejudge the 
role of general intelligence as a source of variance in creativity tests or 
factors derived therefrom? Does the failure to include an IQ test help 
one learn the extent to which one must go beyond the boundaries of the 
19 to fathom creativity? Apparently the author, although willing to 
predict that the correlations between IQ and the many types of creativity 
tests “are only moderate or low,” was unwilling to include an IQ test 
for the sake of finding out. However, negation by omission is not very 
convincing. 

That at least one test bearing the label “ 
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The authors say that it “is quite surprising” that the high creativity 
group achieved so well. From this it is concluded that the “creative in- 
struments account for a significant portion of the variance in school 
achievement [p. 24],” and the subsequent argument implies that crea- 
tivity is more important for ordinary school achievement than is the 19. 
Now anyone who is at all familiar with a three-variate problem will not 
be “unexpectedly” surprised at the foregoing results. Indeed, if the au- 
thors had bothered to give the three basic correlations among the three 
variables (IQ, creativity, and total school achievement) for the entire 
group, any person versed in simple multivariate analysis could deduce 
the results. Furthermore, he could deduce a further result (and this 
one has been overlooked by the critics) which might be unpleasantly 
surprising to the thesis of these authors: namely, the high IQ and the 
high creative groups did equally well in school achievement despite an 
unreported difference in mean creativity that is of the same order as the 
much stressed difference in IQ.1 Utilizing the half-blind logic of the au- 
thors, one can say that creative ability is not as important as IQ for school 
achievement—just the opposite of their proposition. 

Now the fact that seven of nine replications of this study confirm the 
original findings merely indicates that repetition of the same faulty de- 
sign and false logic will lead to the same false conclusions. The design 
being used is such that if two variables are equally correlated with a 
third, the conclusion will be reached that the two are actually unequally 
correlated with the third. This is the neatest trick of the decade for sup- 
plying educationists with an antidote for the IQ virus. I cannot refrain 
from saying at this point that, although discouraged, I am still hopeful 
that people who do statistical studies will first learn a modicum of ele- 
mentary statistics! 

Time does not permit a discussion of other studies in which creativity 
is defined in terms of test performance instead of being based on actual 
creativity of the sort prized by society. In summary of this brief on crea- 
tivity studies, I would like to offer a few dogmatic-sounding observations. 
First, one need not be surprised at the fact that so-called creativity tests 
do not yield high correlations with IQ tests—but the correlations are 
generally far higher than those found in typical studies with range re- 
Strictions. I would anticipate that for normalized scores, the uncurtailed 
Scatters for IQ versus creativity tests will be bivariate normal. Second, 
if we have honest to goodness criterion measures of literary or archi- 
tectural or scientific creativity, the scatter diagram between IQ and such 
creativity (not normalized, since it makes sense to expect askewed dis- 
tribution for actual creativity) will be triangular in shape for unselected 
cases. That is, at the high IQ levels there will be a very wide range of 
creativity, whereas as we go down to average IQ and on down to lower 
levels, the scatter for creativity will be less and less. Having a high IQ 
is not a guarantee of being creative; having a low IQ means creativity 
is impossible. Third, it remains to be seen whether or not the so-called 
creativity tests and/or factors derived therefrom have appreciable value 
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as predictors of actual creative performance. Such tests may or may nu 
yield better predictions than a test of general intelligence. Fourth, as far 
as I am concerned, to claim factorial validity for creativity tests, along 
with definitions of creativity in terms of tests, is an unwarranted avoid- 
ance of the fundamental problem of validity. 

The recently renewed interest in gifted children, along with the flurry 
of creativity studies, has led to a reexamination of methods for identi- 
fying the gifted. It has long been recognized that identification in terms 
of high IQ is too narrow—those gifted in such areas as art and music 
would be overlooked. The argument against the IQ is now (Torrance, 
1962) being reinforced by the claim that the selection of the top 20% on 
IQ would mean the exclusion of 70% of the top 20% on tested creativity. 
This startling statistic, which implies a correlation of only .24 between 
IQ and creativity, is being used to advocate the use of creativity tests 
for identifying the gifted. Be it noted that these creativity tests will also 
miss those gifted in art and music. 


We are being told that it is important 


"to identify ereative talent 
early in life." 


Hence you need not be surprised that the search goes down 
to the kindergarten level, with claims of successful identifieation. The 
creativity tests are presumed to be better for this purpose than the IQ 
tests because of the failure of the IQ to be constant, an argument that 
completely overlooks the fact that the IQ does have some constancy 
whereas absolutely nothing is known about the stability o 
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tests, known to be of unknown validity as predictors. Anyway, progress, 
defined as change, is in the offing. 
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difficulty. Again, I express the hope that the IQ is replaced by something 
better rather than by something worse. 

There are other areas, such as reasoning, problem solving, and concept 
formation, in which one might expect to find some consideration of intelli- 
gence as an aspect. One might also expect that investigators of thinking 
would have something to say about individual differences in thinking 
being dependent upon intelligence, but for some unintelligent reason, 
these people seem never to mention intelligence. Surely it cannot be 
inferred that thinking about thinking does not involve intelligence! 


IN CONCLUSION 


It has been the thesis of this paper that the concept of general intelli- 
gence, despite being maligned by a few, regarded as a second-order 
funetion by some, and discarded or ignored by others, still has a rightful 
place in the science of psychology and in the practical affairs of man. It 
has not been argued that the nature of general intelligence is well under- 
stood. Much, however, has been written about its nature. Over 40 years 
ago (Intelligence, 1921a, 19215) an editor secured and published the 
reasoned views of 18 well-known test psychologists. Later Spearman 
set forth his speculations about the nature of g. Prior to these, Binet 
had, of course, given much thought to the problem. 

More recent discussions exist. Hebb (1949) has considered the problem 
from the viewpoint of neurology and brain functioning. Cyril Burt (1955), 
always a vociferous defender of the concept of general intelligence, has 
reviewed the evidence for a g and restated the idea, dreadful to some, 
that intelligence is innate. Perhaps it was inevitable that Raymond 
Cattell (1963), who has camped with the general intelligence contingent, 
should gaze into his crystal n-dimensional factor ball and find evidence 
for crystallized as opposed to fluid general intelligence. Joseph McVicker 
Hunt’s (1961) book on Intelligence and Experience is in large part de- 
voted to questions pertaining to the nature of intelligence. 

By far the most provocative recent discussion that I have encountered 
is the closely reasoned 44-page paper by Keith Hayes (1962). He puts forth 
a motivational-experiential theory of intelligence. In essence, he pre- 
sumes that there are hereditary differences in motivation. “Experience- 
producing drives” and environmental differences produce differences in 
experience, which in turn, by way of learning, lead to differences in 
ability. Therefore, differences called intellectual are nothing more than 
acquired abilities. I think that Hayes has ignored the possibility of in- 
dividual differences in learning ability, but if such a formulation leads 
to experimental manipulation of variables, we may eventually make 
progress in an area that has too long been dominated by ever increasing 
fractionization by factor analysis, with little thought as to how the frac- 
tured parts get put together into a functioning whole. 

Abilities or capacities or aptitudes or intellectual skills or whatever 
you choose to call them, are measured in terms of response products 
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to standardized stimulus situations. The stimulus is presented to an 
organism which by some process comes up with a response; thus any 
attempt to theorize and/or study intellect in terms of a simple stimulus- 
response (S-R) paradigm seems doomed to failure unless drastically 
modified and complicated by the insertion of O for organism and P for 
process. | . 

There have been thousands of researches on the multitudinous varia- 
tions from organism to organism, and the results fill books on individual 
differences. These studies can be roughly classified into two types. First, 
those that ascertain the intercorrelations among scaled response prod- 
ucts to various stimulus situations, known as tests, have to do with the 
structure of intellect; and whether the resulting factors are anything 
more than dimensions for describing individual differences need not 
concern us here. The second type of study seeks the nontest correlates 
of test performance, and whether or not any of the found correlates can 
be regarded as explaining individual differences is not of interest here. 
Both types of studies certainly force one to stress the overwhelming 
diversity exhibited among the organisms. 

But these studies of individual differences never come to grips with the 
process, or operation, by which a given organism achieves an intellectual 
response. Indeed, it is difficult to see how the available individual dif- 
ference data can be used even as a starting point for generating a theory 
as to the process nature of general intelligence or of any other Specified 
ability. 

As a basis for a little speculation, let us conceive of a highly hypotheti- 
cal situation in which the two members of а pair of identical twins, with 
identical experiences, find themselves cast up on an uninhabited tropical 
island. Let us assume that they are at the supergenius level, far beyond 
that of your favorite man of genius. Let us also assume that, though 
highly educated in the sciences, they have been fortunate enough to have 
had zero exposure to psychology. In addition, we presume that, being 
highly involved and abstracted in the pursuit of science, they have never 
noticed what we call individual differences in abilities. 

A quick exploration of the island assures them that food is plentiful, 
that shelter is available, and that clothing is not a necessity. To allay 
the boredom that they foresee as an eternity in this laborless heaven, 
they decide to spend their time in the further pursuit of science, but the 
lack of the wherewithal for constructing gadgets rules out any research 
in the physical sciences. Having had a college course in Bugs and Bites, 
they proceed to study the life of the island’s insects, then the habits of 
the birds, and the antics of a couple of monkeys. The manner in which the 
monkeys adjust to the environment leads them to set up some trial situa- 
tions for more systematic observation. Needless to say, the monkeys 
show evidence of what we call learning and what we call problem solving. 

Eventually they decide that attempting to outwit each other might 
be more fun than being outwitted by the monkeys, so they begin to cook 
up and use games and problems for this purpose. This activity leads each 
to speculate and introspect about how problems are invented and how 
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solved. Then by cleverly designed experiments, preceded of course by 
theory, they set forth highly developed laws and principles about what 
we call reasoning and problem solving. Incidentally, they switch back 
and forth between the roles of experimenter and subject, there being 
no college sophomores available. They continue for years the study of 
their own mental operations, constantly on the alert for new phenomena 
to investigate. 

And now with apologies to the ancient Greeks, who did have some 
ideas along these lines, we leave with you the 64-million drachma ques- 
tion: Will our two identical supergeniuses, being totally unaware of indi- 
vidual differences, ever hit upon and develop a concept of intelligence? 
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NOTES 


1. Since this was written, the replication study of Yamamoto (1964) gives data that 
corroborate this deduction. 


TESTING FOR COMPETENCE 
RATHER THAN FOR “INTELLIGENCE” 


David С. McClelland, Harvard University 


The testing movement in the United States has been a success, if one 
judges success by the usual American criteria of size, influence, and 
profitability. Intelligence and aptitude tests are used nearly everywhere 
by schools, colleges, and employers. It is a sign of backwardness not to 
have test scores in the school records of children. The Educational Test- 
ing Service alone employs about 2,000 people, annually administers 
Scholastic Aptitude Tests to thousands of aspirants to college, and makes 
enough money to support a large basic research operation. Its tests have 
tremendous power over the lives of young people by stamping some of 
them “qualified” and others “less qualified” for college work. Until recent 
exceptions were made (over the protest of some), the tests have served 
as a very efficient device for Screening out black, Spanish-speaking and 
other minority applicants to colleges. Admissions officers have protested 
that they take other qualities besides test achievements into account in 
granting admission, but careful studies by Wing and Wallach (1971) and 
others have shown that this is true only to a very limited degree, 
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Why should intelligence or aptitude tests have all this power? What 
justifies the use of such tests in selecting applicants for college entrance 
or jobs? On what assumptions is the success of the movement based? 
They deserve careful examination before we go on rather blindly pro- 
moting the use of tests as instruments of power over the lives of many 
Americans. 

The key issue is obviously the validity of so-called intelligence tests. 
Their use could not be justified unless they were valid, and it is my con- 
viction that the evidence for their validity is by no means so overwhelm- 
ing as most of us rather unthinkingly had come to think it was. In point of 
fact, most of us just believed the results that the testers gave us, without 
subjecting them to the kind of fierce skepticism that greets, for example, 
the latest attempt to show that ESP exists. My objectives are to review 
skeptically the main lines of evidence for the validity of intelligence and 
aptitude tests and to draw some inferences from this review as to new 
lines that testing might take in the future. 

Let us grant at the outset that brain-damaged or retarded people do 
less well on intelligence tests than other people. Wechsler (1958) initially 
used this criterion to validate his instrument, although it has an obvious 
weakness: brain-damaged people do less well on almost any test so that 
it is hard to argue that something unique called lack of intelligence is 
responsible for the deficiency in test scores. The multimethod, multitrait 
criterion has not been applied here. 


TESTS PREDICT GRADES IN SCHOOL 


The games people are required to play on aptitude tests are similar 
to the games teachers require in the classroom. In fact, many of Binet's 
original tests were taken from exercises that teachers used in French 
Schools. So it is scarcely surprising that aptitude test scores are corre- 
lated highly with grades in school. The whole Scholastic Aptitude Testing 
movement rests its case largely on this single undeniable fact. Defenders 
of intelligence testing, like MeNemar (1964), often seem to be suggesting 
that this is the only kind of validity necessary. МеМетаг remarked that 
"the manual of the Differential Aptitude Test of the Psychological Cor- 
poration contains a staggering total of 4,096—yes, I counted ’em—valid- 
ity coefficients." What more could you ask for, ladies and gentlemen? It 
was not until I looked at the manual myself (MeNemar certainly did not 
enlighten me) that I confirmed my suspicion that almost every one of 
those validity coefficients involved predicting grades in courses—in 
other words, performing on similar types of tests. 

So what about grades? How valid are they as predictors? Researchers 
have, in fact, had great difficulty demonstrating that grades in school 
are related to any other behaviors of importance—other than doing well 
on aptitude tests. Yet the general publie—including many psychologists 
and most college officials—simply has been unable to believe or accept 
this fact. It seems so self-evident to educators that those who do well in 
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their classes must go on to do better in life that they systematically have 
disregarded evidence to the contrary that has been accumulating for 
some time. In the early 1950s a committee of the Social Science Research 
Council of which I was chairman looked into the matter and concluded 
that while grade level attained seemed related to future measures of 
success in life, performance within grade was related only slightly. In 
other words, being a high school or college graduate gave one acredential 
that opened up certain higher level jobs, but the poorer students in high 
school or college did as well in life as the top students. As a college teacher 
I found this hard to believe until I made a simple check. I took the top 
eight students in a class in the late 1940s at Wesleyan University, where 
I was teaching—all straight-A students—and contrasted what they were 
doing in the early 1960s with what eight really poor students were do- 
ing—all of whom were getting barely passing averages in college (C— or 
below). To my great surprise, I could not distinguish the two lists of 
men 15-18 years later. There were lawyers, doctors, research scientists, 
and college and high school teachers in both groups. The only difference 
I noted was that those with better grades got into better law or medical 
schools, but even with this supposed advantage they did not have notably 
more successful careers as compared with the poorer students who had 
had to be satisfied with second-rate law and medical schools at the outset. 
Doubtless the С— students could not get into even second-rate law and 
medical schools under the stricter admissions testing standards of today. 
Is that an advantage for society? 
Such outcomes have been documented carefully by many researchers 
(cf. Hoyt, 1965) both in Britain (Hudson, 1960) and in the United States. 
Berg (1970) in a book suggestively titled Education and Jobs: The Great 
Training Robbery has summarized studies showing that neither amount 
of education nor grades in school are related to vocational success as à 
factory worker, bank teller, or air traffic controller. Even for highly 
intellectual jobs like scientific researcher, Taylor, Smith, and Ghiselin 
(1963) have shown that superior on-the-job performance is related in no 
way to better grades in college. The average college grade for the top 
third in research success was 2.78 (about B—) and for the bottom third, 
2.69 (also B—). Such facts have been known for some time. They make 
it abundantly clear that the testing movement is in grave danger of per- 
petuating a mythological meritocracy in which none of the measures of 
merit bears significant demonstrable validity with respect to any mea- 
sures outside of the charmed circle. Psychologists used to say as a kin 
of an “in” joke that intelligence is what the intelligence tests measure. 
That seems to be uncomfortably near the whole truth and nothing but 
the truth. But what's funny about it, when the publie took us more seri- 
ously than we did ourselves and used the tests to screen people out of 
opportunities for education and high-status jobs? And why call excellence 
at these test games intelligence? 

Even further, why keep the best education for those who are already 
doing well at the games? This, in effect, is what the colleges are doing 
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when they select from their applicants those with the highest Scholastic 
Aptitude Test scores. Isn’t this like saying that we will coach especially 
those who already can play tennis well? One would think that the purpose 
of education is precisely to improve the performance of those who are not 
doing very well. So when psychologists predict on the basis of the Scho- 
lastic Aptitude Test who is most likely to do well in college, they are 
suggesting implicitly that these are the best bets to admit. But in another 
sense, if the colleges were interested in proving that they could educate 
people, high-scoring students might be poor bets because they would be 
less likely to show improvement in performance. To be sure, the teachers 
want students who will do well in their courses, but should society allow 
the teachers to determine who deserves to be educated, particularly 
when the performance of interest to teachers bears so little relation to 
any other type of life performance? 


DO INTELLIGENCE TESTS TAP ABILITIES THAT 
ARE RESPONSIBLE FOR JOB SUCCESS? 


Most psychologists think so; certainly the general publie thinks so 
(Cronbach, 1970, p. 300), but the evidence is a whole lot less satisfactory 
than one would think it ought to be to justify such confidence. 

Thorndike and Hagen (1959), for instance, obtained 12,000 correlations 
between aptitude test scores and various measures of later occupational 
success on over 10,000 respondents and concluded that the number of 
significant correlations did not exceed what would be expected by chance. 
In other words, the tests were invalid. Yet psychologists go on using 
them, trusting that the poor validities must be due to restriction in range 
due to the fact that occupations do not admit individuals with lower 
Scores. But even here it is not clear whether the characteristics required 
for entry are, in fact, essential to success in the field. One might suppose 
that finger dexterity is essential to being a dentist and require a minimum 
test score for entry. Yet it was found by Thorndike and Hagen (1959) to 
be related negatively to income as a dentist! Holland and Richards (1965) 
and Elton and Shevel (1969) have shown that no consistent relationships 
exist between scholastie aptitude scores in college students and their 
actual accomplishments in social leadership, the arts, science, music, 
writing, and speech and drama. 

Yet what are we to make of Ghiselli's (1966, p. 121) conclusions, based 
on a review of 50 years of research, that general intelligence tests corre- 
late .42 with trainability and .23 with proficiency across all types of jobs? 
Each of these correlations is based on over 10,000 cases. It is small won- 
der that psychologists believe intelligence tests are valid predictors of 
job success. Unfortunately, it is impossible to evaluate Ghiselli’s conclu- 
sion, as he does not cite his sources and he does not state exaetly how job 
proficiency was measured for each of his correlations. We can draw some 
conclusions from his results, however, and we can make a good guess that 
job proficiency often was measured by supervisors’ ratings or by such 
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reported a correlation of .27 between intelligence test scores and profi- 
ciency as a policeman or a detective (p. 83), with no attention given to 
the very important issues involved in how a policeman’s performance is to 
be evaluated. Will supervisors’ ratings do? If so, it discriminates against 
black policemen (Baehr, Furcon, & Froemel, 1968) because white super- 
visors regard them as inferior. And what about the public? Shouldn’t 
their opinion as to how they are served by the police be part of the cri- 
terion? The most recent careful review (Kent & Eisenberg, 1972) of the 
evidence relating ability test scores to police performance concluded 
that there is no stable, significant relationship. Here is concrete evidence 
that one must view with considerable skepticism the assumed relation 
of intelligence test scores to success on the job. 

One other illustration may serve to warn the unwary about accepting 
uncritically simple statements about the role of ability, as measured by 
intelligence tests, in life outcomes. It is stated widely that intelligence 
promotes general adjustment and results in lower neuroticism. For 
example, Anderson (1960) reported a significant correlation between 
intelligence test scores obtained from boys in 1950, age 14-17, and follow- 
up ratings of general adjustment made five years later. Can we assume 
that intelligence promotes better adjustment to life as has been often 
claimed? It sounds reasonable until we reflect that the “intelligence” test 
is a test of ability to do well in school (to take academic-type tests), that 
many of Anderson’s sample were still in school or getting started on 
careers, and that those who are not doing well in school or getting a good 
first job because of it are likely to be considered poorly adjusted by them- 
selves and others. Here the test has become part of the criterion and has 
introduced the correlation artificially. In case this sounds like special 
reasoning, consider the fact, not commented on particularly by Anderson, 
that the same correlation between intelligence test scores and adjust- 
ment in girls was an insignificant .06. Are we to conclude that intelligence 
does not promote adjustment in girls? It would seem more reasonable to 
argue that the particular ability tested, here associated with scholastic 
success, is more important to success (and hence adjustment) for boys 
than for girls. But this is a far ery from the careless inference that intel- 
ligence tests tap a general ability to adapt successfully to life’s problems 
because high-IQ children (read “теп”) have better mental health (Jen- 
sen, 1972). 

To make the point even more vividly, suppose you are a ghetto resi- 
dent in the Roxbury section of Boston. To qualify for being a policeman 
you have to take a three-hour-long general intelligence test in which you 
must know the meaning of words like “quell,” “pyromaniac,” and “lexi- 
con.” If you do not know enough of those words or cannot play analogy 
games with them, you do not qualify and must be satisfied with some 
such job as being a janitor for which an intelligence test is not required 
yet by the Massachusetts Civil Service Commission. You, not unreason- 
ably, feel angry, upset, and unsuccessful. Because you do not know those 
words, you are considered to have low intelligence, and since you con- 
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ve to take a low-status job and are unhappy, you contribute 
мүн. one correlations of low intelligence with low л 
status and poor adjustment. Psychologists should be ashamed of t T 
selves for promoting a view of general intelligence that has encouraged 
such a testing program, particularly when there is no solid evidence that 


significantly relates performance on this type of intelligence test with 
performance as a policeman. 
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them with (at one point all men in California, including day laborers). 
He had no unequivocal evidence that it was “giftedness” (as reflected in 
his test scores) that was responsible for the superior performance of 
his group. It would be as legitimate (though also not proven) to conclude 
that sons of the rich, powerful, and educated were apt to be more success- 
ful occupationally, maritally, and socially because they had more material 
advantages. To make the point in another way, consider the data in 
Table 1, which are fairly representative of findings in this area. They 
were obtained by Havighurst, Bowman, Liddle, Matthews, and Pierce 
(1962) from a typical town in Middle America. One observes the usual 
strong relationship between social class and IQ and between IQ and 
college-going—which leads on to occupational success. The traditional 
interpretation of such findings is that more stupid children come from 
the lower classes because their parents are also stupid which explains 
why they are lower class. A higher proportion of children with high IQ 
go to college because they are more intelligent and more suited to college 
study. This is as it should be because IQ predicts academic success. The 
fact that more intelligent people going to college come more often from 
the upper class follows naturally because the upper classes contain more 
intelligent people. So the traditional argument has gone for years. It 
seemed all very simple and obvious to Terman and his followers. 


Table 1 


Numbers of Students in Various © and SES Categories (Sixth Grade) 
and Percentage Subsequently Going to College 


10 SOCIOECONOMIC STATUS 
High % to College Low % to College 
High 51 71 57 23 
Low 33 18 96 5 


Note, x? = 11.99, р < .01, estimated tetrachorie r = .35, SES х IQ. (Table adapted from 
Havighurst et al., 1962. Copyrighted by Wiley, 1962.) 


However, a closer look at Table 1 suggests another interpretation that 
is equally plausible, though not more required by the data than the one 
just given. Compare the percentages going to college in the “deviant” 
boxes—high socioeconomic status and low IQ versus high IQ and low 
socioeconomic status. It appears to be no more likely for the bright 
children (high IQ) from the lower classes to go to college (despite their 
high aptitude for it) than for the “stupid” children from the upper classes. 
Why is this? An obvious possibility is that the bright but poor children 
do not have the money to go to college, or they do not want to go, pre- 
ferring to work or do other things. In the current lingo, they are “disad- 
vantaged” in the sense that they have not had access to the other factors 
(values, aspirations, money) that promote college-going in upper-class 
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children. But now we have an alternative explanation of college-going— 
namely, socioeconomic status, which seems to be as good a predictor of 
this type of success as ability. How can we claim that ability as measured 
by these tests is the critical factor in college-going? Very few children, 
even with good test-taking ability, go to college if they are from poor 
families. One could argue that they are victims of oppression: they do 
not have the opportunity or the values that permit or encourage going 
to college. Isn’t it likely that the same oppressive forces may have pre- 
vented even more of them from learning to play school games well at all? 

Belonging to the power elite (high socioeconomic status) not only helps 
a young man go to college and get jobs through contacts his family has; 
it also gives him easy access as a child to the credentials that permit him 
to get into certain occupations. Nowadays, those credentials include the 
words and word-game skills used in Scholastic Aptitude Tests. In the 
Middle Ages they required knowledge of Latin for the learned profes- 
sions of law, medicine, and theology. Only those young men who could 
read and write Latin could get into those occupations, and if tests had 


been given in Latin, Iam sure they would have shown that professionals 
scored higher in Latin than men in general, that sons who grew up in 
families where Latin was 


used would have an advantage in those tests 
compared to those in poor families where Latin was unknown, and that 
these men were more likely to get into the professions. But would we 
conclude we were dealing with a general ability factor? Many a ghetto 
resident must or should feel that he is in a similar position with regard 
to the kind of English he must learn in order to do well on tests, in school, 
and in occupations today in America. I was recently in Jamaica, where 
all around me poor people were Speaking an English that was almost 


entirely incomprehensible to me. If I insisted, they would speak patiently 
in a way that I could understand, but I 
have wondered ho 


comprehension and so ignorant of ordinary vocabulary. 
When Cronbach (1970) concluded 


information on the presence of a han 


nce tests are invalid? As so often when 
у in psychology, the answer depends on 
what you mean. Valid for what? Certainly they are valid for predicting 
who will get ahead in a number of prestige jobs where credentials are 
important. So is white skin: it too is a valid predictor of job success in 
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prestige jobs. But no one would argue that white skin per se is an ability 
factor. Lots of the celebrated correlations between so-called intelligence 
test scores and success can lay no greater claim to representing an ability 
factor. 

Valid for predicting success in school? Certainly, because school suc- 
cess depends on taking similar types of tests. Yet neither the tests nor 
school grades seem to have much power to predict real competence in 
many life outcomes, aside from the advantages that credentials convey 
on the individuals concerned. 

Are there no studies which show that general intelligence test scores 
predict competence with all of these other factors controlled? I can only 
assert that I have had a very hard time finding a good carefully controlled 
study of the problem because testers simply have not worked very hard 
on it: they have believed so much that they were measuring true compe- 
tence that they have not bothered to try to prove that they were. Studies 
do exist, of course, which show significant positive correlations between 
special test scores and job-related skills. For example, perceptual speed 
scores are related to clerical proficiency. So are tests of vocabulary, 
immediate memory, substitution, and arithmetic. Motor ability test 
scores are related to proficiency as a vehicle operator (Ghiselli, 1966). 
And so on. Here we are on the safe and uncontroversial ground of using 
tests as criterion samples. But this is a far ery from inferring that there 
is a general ability factor that enables a person to be more competent in 
anything he tries. The evidence for this general ability factor turns out 
to be contaminated heavily by the power of those at the top of the social 
hierarchy to insist that the skills they have are the ones that indicate 
superior adaptive capacity. 


WHERE DO WE GO FROM HERE? 


Criticisms of the testing movement are not new. The Social Science 
Research Council Committee on Early Identification of Talent made 
some of these same points nearly 15 years ago (McClelland, Baldwin, 
Bronfenbrenner, & Strodtbeck, 1958). But the beliefs on which the 
movement is based are held so firmly that such theoretical or empirical 
objections have had little impact up to now. The testing movement 
continues to grow and extend into every corner of our society. It is un- 
likely that it can be simply stopped, although minority groups may have 
the political power to stop it. For the tests are clearly discriminatory 
against those who have not been exposed to the culture, entrance to 
which is guarded by the tests. What hopefully can happen is that testers 
will recognize what is going on and attempt to redirect their energies in 
a sounder direction. The report of the special committee on testing to 
the College Entrance Examination Board (1970) is an important sign that 
changes in thinking are occurring—if only they can be implemented at 
a practical level. The report's gist is that a wider array of talents should 
be assessed for college entrance and reported as a profile to the colleges. 
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This is a step in the right direction if everyone keeps firmly in mind that 
the criteria for establishing the validity of these new measures really 
ought to be not grades in school but grades in life in the broadest theo- 
retical and practical sense. 

But now I am on the spot. Having criticized what the testing move- 
ment has been doing, I feel some obligation to suggest alternatives. How 
would I do things differently or better? I do not mind making sugges- 
tions, but I am well aware that some of them are as open to criticism on 
other grounds as the procedures I have been criticizing. So I must offer 
them in a spirit of considerable humility, as approaches that at least 
some people might be interested in pursuing who are discouraged with 
what we have been doing. My goal is to brainstorm a bit on how things 
might be different, not to present hard evidence that my proposals are 
better than what has been done to date. How would one test for com- 
petence, if I may use that word as a symbol for an alternative approach 
to traditional intelligence testing? 

1. The best testing is criterion sampling. The point is so obvious that 
it would scarcely be worth mentioning if it had not been obscured so often 
by psychologists like MeNemar and Jensen, who tout a general intelli- 
gence factor. If you want to know how well а person can drive a car (the 
criterion), sample his ability to do so by giving him a driver's test. Do not 
give him a paper-and-pencil test for following directions, a general intelli- 
gence test, ete. As noted above, there is ample evidence that tests which 
sample job skills will predict proficiency on the job. 

Academic skill tests are successful precisely because they involve cri- 
terion sampling for the most part. As already pointed out, the Scholastic 
Aptitude Test taps skills that the teacher is looking for and will give high 
grades for. No one could object if it had been recognized widely that this 
was all that was going on when aptitude tests were used to predict who 


would do well in school. Trouble started only when people assumed that 


these skills had some more general validity, as implied in the use of words 


like “intelligence.” Yet even a little criterion analysis would show that 
there are almost no occupations or life situations that require a person to 
do word analogies, choose the most correct of four alternative meanings 
of a word, ete. 

Criterion sampling means that testers have got to get out of their 
offices where they play endless word and paper-and-pencil games and 
into the field where they actually analyze performance into its compo- 
nents. If you want to test who will be a good policeman, go find out what 
a policeman does. Follow him around, make a list of his activities, and 
sample from that list in screening applicants. Some of the job sampling 
will have to be based on theory as well as practice. If policemen generally 
discriminate against blacks, that is clearly not part of the criterion be- 
cause the law says that they must not. So include a test which shows the 
applicant does not discriminate. Also sample the vocabulary he must use 
to communicate with the people he serves since his is a position of inter- 
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personal influence—and not the vocabulary that men who have never 
been on a police beat think it is proper to know. And do not rely on super- 
visors’ judgments of who are the better policemen because that is not, 
strictly speaking, job analysis but analysis of what people think involves 
better performance. Baehr et al. (1968), for instance, found that black 
policemen in Chicago who were rated high by their superiors scored high 
on the Deference scale of the Edwards Personal Preference Test. No 
such relationship appeared for white policemen. In other words, if you 
wanted to be considered a good cop in Chicago and you were black, you 
had to at least talk as if you were deferent to the white power system. 
Any psychologist who used this finding to pick black policemen would 
be guilty of improper job analysis, to put it as mildly as possible. 

Criterion sampling, in short, involves both theory and practice. It 
requires real sophistication. Early testers knew how to do it better than 
later testers because they had not become so caught up in the ingrown 
world of intelligence tests that simply were validated against each other. 
Testers of the future must relearn how to do criterion sampling. If some- 
one wants to know who will make a good teacher, they will have to get 
videotapes of classrooms, as Kounin (1970) did and find out how the be- 
haviors of good and poor teachers differ. To pick future businessmen, 
research scientists, political leaders, prospects for a happy marriage, 
they will have to make careful behavioral analyses of these outcomes and 
then find ways of sampling the adaptive behavior in advance. The task 
will not be easy. It will require new psychological skills not ordinarily 
in the repertoire of the traditional tester. What is called for is nothing 
less than a revision of the role itself—moving it away from word games 
and statistics toward behavioral analysis. 

2. Tests should be designed to reflect changes in what the individual 
has learned. It is difficult, if not impossible, to find a human characteris- 
tic that cannot be modified by training or experience, whether it be an 
eye blink or copying Kohs’ block designs. To the traditional intelligence 
tester this fact has been something of a nuisance because he has been 
searching for some unmodifiable, unfakeable index of innate mental 
capacity. He has reacted by trying to keep secret the way his tests are 
scored so that people will not learn how to do them better, and by select- 
ing tests, scores on which are stable from one administration to the next. 
Stability is supposed to mean that the score reflects an innate aptitude 
that is unmodified by experience, but it could also mean that the test is 
simply insensitive to important changes in what the person knows or 
can do. That is, the skill involved may be so specialized, so unrelated to 
general experience, that even though the person has learned a lot, he 
performs the same in this specialized area. For example, being able to 
play a word game like analogies is apparently little affected by a higher 
education, which is not so surprising since few teachers ask their stu- 
dents to do analogies. Therefore, being able to do analogies is often 
considered a sign of some innate ability factor. Rather, it might be called 
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an achievement so specialized that increases in general wisdom do not 
transfer to it and cause changes in it. And why should we be interested 
in such specialized skills? As we have seen, they predictably do not seem 
to correlate with any life-outcome criteria except those that involve sim- 
ilar tests or that require the credentials that a high score on the test 
signifies. 

It seems wiser to abandon the search for pure ability factors and to 
select tests instead that are valid in the sense that scores on them change 
as the person grows in experience, wisdom, and ability to perform effec- 
tively on various tasks that life presents to him. Thus the second principle 
of the new approach to testing becomes a corollary of the first. If one 
begins by using as tests samples of life-outcome behaviors, then one way 
of determining whether those tests are valid is to observe that the per- 
son’s ability to perform them increases as his competence in the life- 
outcome behavior increases. For example, if excellence in a policeman 
is defined partly in terms of being even-handed toward all minority groups, 
then a test of fair-mindedness (or lack of ethnocentrism) might be used 
to select policemen and also should reflect growth in fair-mindedness as a 
police recruit develops on the job. One of the hidden prejudices of psy- 
chology, borrowed from the notion of fixed inherited aptitudes, is that 
any trait, like racial prejudice, is unmodifiable by training. Once a bigot, 
always a bigot. There is no solid evidence that this trait or any other 
human trait cannot be changed. So it is worth insisting that a new test 
should be designed especially to reflect growth in the characteristic it 
assesses. 

3. How to improve on the characteristic tested should be made public 
and explicit. Such a principle contrasts sharply with present practice 
in which psychologists have tried hard—backed up by the APA Ethics 
Committee—to keep answers to many of their tests a secret lest people 
practice and learn how to do better on them or fake high scores. Faking 
a high score is impossible if you are performing the criterion behavior, 
as in tests for reading, spelling, or driving a саг. Faking becomes possible 
the more indirect the connection is between the test behavior and the 
criterion behavior. For example, in checking out hundreds of items for 
predicting flight training success, it may turn out that something like 
playing the piano as a boy has diagnostic validity. But no one knows 
exactly why: perhaps it has something to do with mechanical ability, 
perhaps with a social class variable, or with conscientiousness in prac- 
ticing. The old-fashioned tester could not care less what the reason was 
as long as the item worked. But he had to be very careful about security 
because men who wanted to become pilots easily could report they had 
played the piano if they knew such an answer would help them be se- 
lected. If playing the piano actually helped people become better pilots— 
which no psychologist bothered to check out in World War II—then it 
might make some sense to make this known and encourage applicants 
to learn to play. That would be very like the criterion-sampling approach 
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to testing proposed here, in which the person tested is told how to im- 
prove on the characteristic for which he will be tested. 

Or to take another example, doing analogies is a task that predicts 
grades in school fairly well. Again no one knows quite why because school- 
work ordinarily does not involve doing analogies. So psychologists have 
had to be security conscious for fear that if students got hold of the anal- 
ogies test answers, they might practice and become good at analogies and 
fake high aptitude. What is meant by faking here is that doing well on 
analogies is not part of the criterion behavior (getting good grades), or 
else it could hardly be considered faking. Rather, the test must have 
some indirect connection with good grades, so that doing well on it through 
practice destroys its predictive power: hence the high score is a fake. 
The person can do analogies, but that does not mean any longer that he 
will get better grades. Put this way, the whole procedure seems like a 
strange charade that testers have engaged in because they did not know 
what was going on, behaviorally speaking, and refused to take the trou- 
ble to find out as long as the items worked. How much simpler it is, both 
theoretically and pragmatically, to make explicit to the learner what the 
criterion behavior is that will be tested. Then psychologist, teacher, and 
student can collaborate openly in trying to improve the student’s score 
on the performance test. Certain school achievement tests, of course, 
follow this model. In the Iowa Test of Basic Skills, for instance, both pupil 
and teacher know how the pupil will be tested on spelling, reading, or 
arithmetic, how he should prepare for the test, how the tests will be 
scored, ete. What is proposed here is that all tests should follow this 
model. To do otherwise is to engage in power games with applicants over 
the secrecy of answers and to pretend knowledge of what lies behind 
correlations, which does not in fact exist. 

4. Tests should assess competencies involved in clusters of life out- 
comes. If we abandon general intelligence or aptitude tests, as proposed, 
and move toward criterion sampling based on job analysis, there is the 
danger that the tests will become extremely specific to the criterion 
involved. For example, Project ABLE (Gagné, 1965) has identified over 
50 separate skills that can be assessed for the exit level of millman ap- 
prentice (job family: woodworker and related occupations). They include 
skills like “measures angles,” “sharpens tools and planes,” and “identifies 
sizes and types of fasteners using gauges and charts.” This approach has 
all of the characteristics of the new look in testing so far proposed: the 
tests are criterion samples; improvement in skill shows up in the tests; 
how to pass them is public knowledge; and both teacher and pupil can 
collaborate to improve test performance. However, what one ends up 
with is hundreds, even thousands, of specific tests for dozens of different 
occupations. For some purposes it may be desirable to assess competen- 
cies that are more generally useful in clusters of life outcomes, including 
not only occupational outcomes but social ones as well, such as leadership, 
interpersonal skills, ete. Project ABLE has been excellent at identifying 
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the manual skills involved in being a service station attendant, but so far 
it has been unable to get a simple index of whether or not the attendant 
is pleasant to the customers. | 0 

Some of these competencies may be rather traditional cognitive ones 
involving reading, writing, and calculating skills. Others should involve 
what traditionally have been called personality variables, although they 
might better be considered competencies. Let me give some illustrations. 

(a) Communication skills. Many jobs and most interpersonal situa- 
tions require a person to be able to communicate accurately by word, 
look, or gesture just what he intends or what he wants done. Writing is 
one simple way to test these skills. Can the person put together words 
in a way that makes immediate good sense to the reader? Word-game 
skills do not always predict this ability, as is often assumed. I will never 
forget an instance of a black student applicant for graduate school at 
Harvard who scored in something like the fifth percentile in the Miller 
Analogies Test, but who obviously could write and think clearly and 
effectively as shown by the stories he had written as a reporter in the 
college paper. I could not convince my colleagues to admit him despite 
the fact that he had shown the criterion behavior the Analogies Test is 
supposed to predict. Yet if he were admitted, as a psychologist he would 
be writing papers in the future, not doing analogies for his colleagues. 
It is amazing to me how often my colleagues say things like: “I don’t care 
how well he can write. Just look at those test scores.” Testers may shud- 
der at this and write public disclaimers, but what practically have they 
done to stop the spread of this blind faith in test scores? 

In Ethiopia in 1968 we were faced with the problem of trying to find 
out how much English had been learned by high school students who had 
been taught by American Peace Corps volunteers. The usual way of 
doing this there, as elsewhere, is to give the student a “fill in the blanks,” 
multiple-choice objective test to see whether the student knows the 
meaning of words, understands correct grammatical forms, etc. We felt 
that this left out the most important part of the criterion behavior: the 
ability to use English to communicate. So we asked students to write 
brief stories, which we then coded objectively, not for grammatical or 
spelling correctness, but for complexity of thought which the student was 
able to express correctly in the time allotted. This gave a measure of 
English fluency that predictably did correlate with occupational success 
among Ethiopian adults and also with school success, although curiously 
enough it was significantly negatively related to a word-game skill (En- 


glish antonyms) that more nearly approximates the usual test of English 
competence (Bergthold, 1969). 
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stance, has found a way to measure success at enacting certain emotions 
so that others receive them correctly and to measure success at receiving 
the correct emotions over various enactors. He found that black boys at 
a certain age were consistently better than white boys at this particular 
kind of communication skill, which is a far more crucial type of criterion 
behavior than most paper-and-pencil tests sample. 

(b) Patience, or response delay as psychologists would call it, is a hu- 
man characteristic that seems essential for many life outcomes. For 
instance, it is desirable for many service occupations where clients’ needs 
and demands can be irritating. It would seem particularly desirable in a 
policeman who has the power and authority to do great damage to people 
who irritate him. Kagan, Pearson, and Welch (1966) have shown that it 
is an easily measured human characteristic that is relatively stable over 
time and can be taught directly. 

(c) Moderate goal setting is important in achievement-related games, 
as I have explained fully elsewhere (McClelland, 1961). In most life situa- 
tions, it is distinctly preferable to setting goals either too high or too low, 
which leads more often to failure. Many performance situations have 
been devised which measure the tendency to set moderate, achievable 
goals and help the person learn how to set more realistic goals in the 
future (Alschuler, Tabor, & McIntyre, 1970; McClelland & Winter, 1969). 

(d) Ego development. Many scholars (see Erikson, 1950; Loevinger 
& Wessler, 1970; White, 1959) have reasoned that there is a general kind 
of competence which develops with age and to a higher level in some 
people than in others. Costa (1971a) recently has developed a Thematic 
Apperception Test code for ego development which appears to have many 
of the aspects sought in the new measurement direction proposed here. 
The thought characteristics sampled represent criterion behavior in the 
sense that at Stage 1, for example, the person is thinking at a passive 
conformist level, whereas at Stage 4, he represents people in his stories 
as taking initiative on behalf of others (a more developed competency). 
The score on this measure predicts very well which junior or high school 
students will be perceived by their teachers as more competent (even 
when correlations with intelligence and grade performance are removed), 
and furthermore a special kind of education in junior high school moves 
students up the ego development scale significantly. That is, training 
designed to develop a sense of initiative produced results that were re- 
flected sensitively in this score. Pupils and teachers can collaborate in 
increasing this kind of thinking which ought to prepare students for 
competent action in many spheres of life. 

5. Tests should involve operant as well as respondent behavior. One 
of the greatest weaknesses of nearly all existing tests is that they struc- 
ture the situation in advance and demand a response of a certain kind 
from the test taker. They are aimed at assessing the capacity of a person 
to make a certain kind of response or choice. But life outside of tests 
seldom presents the individual with such clearly defined alternatives as 
“Which dog is most likely to bite?” or “Complete the following number 
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series: 13 6 10 15 __,” or “Check the word which is most similar in mean- 
ing to lewicon.” If we refer to these latter behaviors as respondents in 
the sense that the stimulus situation clearly is designed to evoke a par- 
tieular kind of response, then life is much more apt to be characterized 
by operant responses in the sense that the individual spontaneously 
makes a response in the absence of a very clearly defined stimulus. This 
fact probably explains why most existing tests do not predict life-outcome 
behaviors. Respondents generally do not predict operants. To use a crude 
example, a psychologist might assess individual differences in the capac- 
ity to drink beer, but if he used this measure to predict actual beer con- 
sumption over time, the chances are that the relationship would be very 
low. How much beer a person can drink is not related closely to how much 
he does drink. 4 
Testers generally have used respondent behaviors to save time їп scor- 
ing answers and to get higher test-retest reliability. That is, the person 
is more likely to give the same response in a highly structured situation 
than in an unstructured one that allows him to emit any behavior. Yet 
slavishly pursuing these goals has led to important lacks in validity of 
the tests because life simply is not that structured and often does not 
permit one to choose between defined-in-advance responses. The n 
Achievement measure, which is an operant in the sense that the subject 
emits responses (tells stories) under only very vague instructions, has 
predicted over a 12-14-year period in three different samples those who 
will drift into entrepreneurial business occupations (McClelland, 1965). 
Here an operant is predicting an operant—the tendency to think spon- 
taneously about doing better all the time predicts a series of spontaneous 
acts over time which leads the individual into an entrepreneurial occupa- 
tion. But predicting from operants to respondents or vice versa does not 
work, at least for men (McClelland, 1966). The n Achievement score is 
not related to grades or academic test scores (respondent measures), 
nor do grades relate to entering entrepreneurial occupations (see Mc- 
Clelland, 1961). 
Even within fairly structured test situations it is possible to allow for 
more operant behavior than has been the usual practice. Not long ago we 
tried to find an existing performance test on which a person with high n 
Achievement ought to do well because such a test might be a useful sub- 
stitute for the Thematic Apperception Test storytelling measure in 
certain situations. Theoretically, such a test should permit operant 
behavior in which the individual generates a lot of alternatives for solving 
а problem in search of the most efficient solution. But to our surprise, 
We could find no such test. Tests of divergent thinking existed that 
counted the number of operants (e.g., original uses for a paper clip) an 
individual could come up with, but they did not require the person to 
find the best alternative. Most other tests simply required the person to 
find the one correct answer the test maker had built into the item. What 
was needed were test items to which there were many correct answers, 
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among which one was better than others in terms of some criteria of 
efficiency that the person would have to apply. This task seemed more 
lifelike to us and certainly more like the type of behavior characteristic 
of people with high n Achievement. So we invented an Airlines Schedul- 
ing Test (Bergthold, 1969), in which the person is faced with a number 
of problems of getting a passenger from City A to City B by such and 
such a time at minimum expenditure in time, energy, money, and dis- 
comfort. From schedules provided, several alternative routes and con- 
nections can be generated (if the test taker is energetic enough to think 
them up) that will solve the problem, but one is clearly the most efficient. 
The test has promise in that it correlates with the n Achievement score 
at a low level. But the main point is that it requires more lifelike operant 
behavior in generating alternative solutions, and therefore it should have 
more predictive power to a variety of situations in which what the person 
is expected to do is not so highly structured as in standard respondent 
tests. 

6. Tests should sample operant thought patterns to get maximum 
generalizability to various action outcomes. As noted already, the 
movement toward defining behavioral objectives in occupational testing 
can lead to great specificity and huge inventories of small skills that have 
little general predictive power. One way to get around this problem is 
to focus on defining thought codes because, almost by definition, they 
have a wider range of applicability to a variety of action possibilities. 
That is, they represent a higher order of behavioral abstraction than 
any given act itself which has not the capacity to stand for other acts the 
way a word does. And in empirical fact this is the way it has worked out. 
The n Achievement score—an operant thought measure—has many 
action correlates, from goal setting and occupational styles to color and 
time-span preferences (McClelland, 1961), which individually have little 
power as “actones” to predict each other. A more recent example is pro- 
vided by an operant thought measure of power motivation which has very 
low positive correlations with four action characteristics: drinking, gam- 
bling, accumulating prestige supplies, and confessing to having many 
aggressive impulses that are not acted on (McClelland, Davis, Kalin, & 
Wanner, 1972). These action characteristics are completely unrelated to 
each other so that they would be unlikely to come out on the same dimen- 
sion in a factor analysis. But what is particularly interesting is that they 
appear to be alternative outlets for the power drive because the power 
motivation score correlates much higher with the maximum expression 
of any one of these alternatives than it does with any one alone or with 
the sum of standard scores on all of them. The thought characteristic— 
here the desire to have impact, to make a big splash—is the higher order 
abstraction that gives the test predictive power for alternative ways of 
making a big splash in action—by gambling, drinking, etc. The tester 
of the future is likely to get further in finding generalizable competencies 
of characteristics across life outcomes if he starts by focusing on thought 
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patterns rather than by trying to infer what thoughts must lie behind the 
clusters of action that come out in various factors in the traditional trait 
analysis. 

However, I have been arguing for this approach for over 20 years, and 
as far as I can see, the testing movement has been affected little by my 
eloquence. Why? There are lots of reasons: People keep insisting that 
the n Achievement score is invalid because it will not predict grades in 
school—which is ironic since it was designed precisely to predict life 
outcomes and not grades in school. Or they argue it does not predict all 
types of achievement (Klinger, 1966)—when, of course, it is not supposed 
to, on theoretical grounds. But the practical problem (outside the tedium 
of content coding) is the unreliability of operant thought measures. Many 
of them are unreliable, though not all. Costa’s (1971b) ego development 
score has a test-retest stability coefficient over a year of .66, N = 223. 
Unreliability is a fatal defect if the goal of testing is to select people, let 
us say, with high n Achievement. For rejected applicants could argue 
that they had been excluded improperly or that they might have high 
scores the next time they took the test, and the psychologist would have 
no good defense. One could just imagine beleaguered psychologists trying 
to defend themselves against irate parents whose children had not gotten 
into a preferred college because their n Achievement scores were too low. 

But the emphasis in the new testing movement should be as much on 
evaluating educational progress as it is on identifying fixed character- 
istics for selection purposes. The operant thought measures are certainly 
reliable enough for the former objective. The educator can use them to 
assess whether a certain class or an innovative approach to teaching has 
tended, on the average, to promote ego development in thought as as- 
sessed by Costa’s measure. The educator does not care which particular 
child is high in the measure since he does not plan to use the measure to 
select the child for special treatment. So its unreliability does not matter. 
He, as an administrator, can use the test information to decide whether 
the goals of the school are being forwarded by one educational approach 
or another. In a sense, the very unreliability of the thought measures 
may be a virtue if they encourage educators to stop thinking only about 
selection and start thinking more about evaluating educational progress. 

Does this mean that test reliability is always unimportant? Not at all. 
Sometimes it will be important to diagnose deficiencies reliably that are 
to be made up. On other occasions, tests will have to be used to pick out 
Sa most р to be able to do a particular job well. So something will 

0 be done about reliability. Thus a man with a high n Achievement 
Score is a better bet for a sales job than a man with a low n Achievement 
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oe es Is not very defensible for selection purposes because it is un- 
reliable. In this instance, the thought cod 
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on thought sampling, it can be used аз a substitute for the latter in selec- 
tion. In fact, the thought codes can be considered devices for finding the 
clusters of action patterns that can be measured more reliably to get 
indexes of various competency domains central to various life outcomes. 
For example, if it turns out that an elevated socialized power (s Power) 
score (McClelland et al., 1972) characterizes successful policemen more 
than unsuccessful ones—as would be expected—then the action corre- 
lates of socialized power, such as capacity to lead or be influential in 
social groups, can be used to select potentially good policemen. The s 
Power score itself could not be so used because it is unreliable and fake- 
able if you learn the scoring system, but it is essential as a validating 
criterion for more reliable measures because its wide network of empiri- 
cal and theoretical relationships helps find the action characteristics 
that will be useful for selection purposes. 

While the six principles just enumerated for the new testing movement 
may affect occupational testing, the fact remains that testing has had its 
greatest impact in the schools and currently is doing the worst damage 
in that area by falsely leading people to believe that doing well in school 
means that people are more competent and therefore more likely to do 
well in life because of some real ability factor. Coneretely, what would 
an organization like the Educational Testing Service do differently if it 
were to take these six principles into account? As a start, it might have 
to drop the term "intelligence" from its vocabulary and speak of scholastic 
achievement tests that are more or less content specific. The non-content- 
specific achievements (formerly called aptitudes) do predict test-taking 
and symbol manipulation competencies, and these competencies are 
central to certain life-outcome criteria—like making up tests for others 
to pass or being proficient as a clerk (Ghiselli, 1966). But it is a serious 
practical and theoretical error to label them general intelligence, on the 
basis of evidence now available. 

Once the innate intelligence philosophy is discarded, it becomes ap- 
parent that the role of such a testing service is to report to schools a 
profile of scholastic and nonscholastic achievements in a number of dif- 
ferent areas. Then in the case of selection it is for the college to decide 
whether it has the educational programs that will promote growth in 
given areas of low performance. If performance is already high, say in 
mathematics, then the college probably can produce little improvement 
in that area and should ask itself in what other areas it can educate such 
a student, as shown by his lower levels of accomplishment at the outset. 
The profile particularly should include measures of such general charac- 
teristics as ego development or moral development (Kohlberg & Turiel, 
1971) based on thought samples, because these general competencies 
ought to be improved by higher educational systems anyway. 

The profile of achievements should be reported not only at entrance 
but at various points throughout the schooling to give teachers, admin- 
istrators, and students feedback on whether growth in desired char- 
acteristics actually is occurring. Test results then become a device for 
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i 65 ri 1 teaching-learning process 
helping students and teachers redesign the | : 5 
to obtain mutually agreed-on objectives. Only then will educational test: 


ing turn from the sentencing procedure it now is into the genuine service 
it purports to be. 
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2. PROCESSING VISUAL 
INFORMATION 


An Example from the 
study of Apparent Motion 
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Unlike intelligence testing, the subject treated in the Ош en t 
articles has not stirred up any great debates outside psye p d d of 
within the field of experimental psychology, particularly in the cpm 
human visual perception, there is considerable interest in how er А 
ceive motion. We know, of course, not only that we can perceive етой, 
tion but also that a compelling illusion of motion can be readily achi still 
an illusion that has led to the development of movies. But there ps в. 
Some uncertainty about the stimulus features that are important tides 
ating the phenomenon of apparent motion. The following two pec 
take different sides on the question of whether the m of the s 
is of any significance in the perception of apparent motion. 

The es dedos are ое n other ways as well. They eat 
two different generations of research—Jesse Orlansky exempli pas 
the older, BC (before computers) researcher, and David Navon, oe 
generation of experimental psychologists skilled in the use of bot 
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lus presentation. 


The results of these 
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EXPERIMENT 3: THE LIMITING EFFECT 
OF DIFFERENT FORMS ON THE PERCEPTION 
OF APPARENT MOVEMENT 


Jesse Orlansky, Institute for Defense Analysis, 
Washington, D.C. 


The problem of this experiment was to determine whether extreme 
difference in form of the stimulus figures would invalidate conditions 
otherwise normal for the perception of movement. An implicit problem 
was whether conditions making for the complete absence of figure- 
similarity could be achieved. On the basis of information reported earlier 
in this investigation, one would suspect that extreme difference in form 
should tend to reduce the range and, if all figure-similarities were en- 
tirely abolished, to eliminate movement. 

About 1910, in Berlin, Linke commercially produced a toy stroboscope 
by means of which a triangle could be seen transformed into a man. In 
his experiments (8, 9) he finds that one form may change, as may one 
color, into another. Linke reported that movement occurred. Our concern 
is an investigation of the conditions under which movement may or may 
not appear. 

The stimulus pairs were figures chosen to resemble each other as little 
as possible. These pairs were composed of simple figures as in the previ- 
ous experiments. No brief is held that an acme of difference was achieved; 
the contemplation of what such extreme differences would be, reduced 
the experimenter to vacuous thoughts. It is held, without fear of contra- 
diction, that they were “pretty different.” 

Five series of figure combinations, which appear in an accompanying 
illustration, were eventually chosen. They comprise Series III-VII. 

While the figures in Series VII were chosen primarily because they 
are different, they pose an interesting sub-problem, revolving mainly 
about the fact that the area of the four-squares figure in combination A 
is 2.40 sq. in.; the figures in all other series total the usual 8 sq. in. The 
result in Series VII makes the super-position of both figures in the A 
pair (unequal in area) form a perfect square, as shown. The super-position 
of the equal-area figures of combination B would form an irregular figure, 
also shown. Actually, most subjects never noticed the difference in area 
between A and B. 


PROCEDURE 
Surely disastrous for the morale of an experiment would be to invite 
a subject to perceive the motion of extravagant figure combinations for 


Orlansky, Jesse. Excerpt from The effect of similarity and difference in form on apparent 
visual movement. Archives of Psychology, 1940, 246, 1-85. Reprinted with permission. 


PROCESSING VISUAL INFORMATION 
46 


z ее ep 
à period of an hour. Particularly if movement failed to be per eris pee ~ 
ine conditions, it would never be known whether these poan a 
have seen movement with ordinary figures. A resolution n д жы», 
lay in presenting these series separately; each of them cou 
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within 10 minutes, . . - We know, therefore, that these subjects E 
perceived apparent motion. And from Gilbert's study (4) and e were 
observations we know that Whatever slight practice effects ore вава 
over the period of an hour generally act to increase rather than de 
the range of movement. 


| in used. 
The apparatus which has already been described was pir 
[Apparatus from earlier section of this article is shown in box. 


APPARATUS 


а. The Exposure Apparatus 


A rectangular wooden 
dimensions: length 26 in., 
А vertical partition di 


box was constructed, with these ея 
height 18 in., and width 10 in. (see eon 
vided the interior into equal-sized, lig 
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Neon lamps 


Milk glass 


Runway for 
stimulus cards 


Ground glass 


TOP VIEW 
FIGURE 1: The apparatus. 


proof compartments. The front of this box was cut away and a 
ground glass screen, 10 by 20 in., inserted so that each compart- 
ment could be used to illuminate its half of the ground glass. A slot 
and guide were built into the right and left sides of the box. An 
Opaque cardboard inserted into either guide cut off the light (of the 
neon bulbs in each compartment) from reaching the ground glass. 
Therefore, any figure cut out of the cardboard could be seen on the 
ground glass only when illuminated from behind. There being two 
Such compartments, independent control of their lights could be 
used to present first one figure and then another. 

| Neon bulbs were used for illumination because their lag in strik- 
Ing and extinction is practically negligible. Each compartment had а 
bank of five neon bulbs. These are standard 3-watt, 120-volt G.E. 
lamps; their light is an orange glow. In our experiment, the lamps 
were operated by direct current. In each compartment, a sheet 
of flash opal glass was inserted to diffuse the light evenly before 
1 reached the ground glass. 


b. The Timer 

The apparatus which produced the alternation of the exposed 
figures was the same as that used by Gilbert in another experiment 
and a complete description may be found in his published paper 
(10). In brief, the time-interval between the figures presented de- 
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-time) was 
blank period between presentations (time- 
interval), 


The procedure was the same as that followed in the previous experi- 
ments to control the factors of area, distance, Position, order of appear- 
ance, duration, and illumination, 


SUBJECTS 


. The subjects in this experiment were the same individuals who served 
in the other parts of this Study. There were 16 subjects for each of the 
five series of this experiment, none of whom appear here twice, making 
for a combined total of 80 different individuals so examined, 

RESULTS 


The Ranges "T Table 1 
їп Experiment з. pavement for the Combinations in Series III. il 
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47 there are two at about 75; including all ranges, the mean falls at 45.8, 
and if the highest two were excluded, the mean would be 37.9 

These should be compared to the ranges found in Experiment 1, where 
the ranges of the 25 combinations ran from 99.9 to 211.1. The mean of 
these values, which includes all the identical and nonidentical combina- 
tions, equals 146.6. The mean of the 5 identical combinations alone equals 
180.3. In Experiment 2 the ranges ran from 100.6 to 158.2, averaging 
129.4; the mean of the 4 highest (the identical pairs) equals 146.2. Re- 
membering that differences as low as 28 points between the ranges of 
the second experiment were statistically reliable, there can be no ques- 
tion that the reduction from these ranges to the ones of Experiment 3 
is enormous. 

Fisher’s test (3) for the significance of small samples was applied to 
these data. This was done only for those combinations seen by the same 
subjects, which indicates the intraseries comparisons. Within each series, 
Fisher's t never reaches here above 1.01, and it would have to be at least 
2.13 to be significant. This means that in every case the two pairs of 
each series are low and do not significantly differ from each other. It 
may be noticed that the ranges of Series V are the highest of all the non- 
sense patterns. The critical ratio between the ranges of figures III A and 
V A (80.2 and 76.0) indicates that this difference is reliable, being equal 
to 4.35. This is probably due to the fact that the small and large circles 
in Series V are actually similar figures, thus differing from the figures 
comprising Series III. We will return to this combination in a later 


experiment. 


b. Analysis of the Types of Movement Effects 
teresting types of movements in the whole 
ith these dissimilar patterns. This material 


is too bulky and unwieldy to be completely presented as a table in the 
paper. A summary of this material appears in Table 2, and there will be, 
in addition, a general discussion of its meaning. — mM 

When movement is easily observed, there is little ambiguity in the 
subject’s report, and generally the movements can be categorized under 
a few headings. The inability to perceive movement is reported on only 
а few trials... . | 

Quite the reverse is the situation in the present experiment, where 


the figure combinations make it difficult to perceive movement. An at- 
tempt to elucidate this condition is the basis of Table 2. When movement 
parently organizes the stimuli 


is difficult of perception, the subject ap 
Into many different phenomenal appearances. The result is that many 
different kinds of movements are observed, as shown in the last column 
of the table. As the number of categories necessary to classify the move- 
ment effects increases, each category must necessarily occupy a smaller 
percent of the total occurrence. Of interest, therefore, is the percent of 
occurrence for that type of movement most frequently observed with 


In some ways the most in 
experiment were observed w 
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Table 2 
Summary af the Analysis of Types of Movements Observed 
in Series III-VII, Experiment 3 


PERCENT OF 
TRIALS IN PERCENT OF NUMBER OF 
SERIES WHICH MOST TRIALS IN WHICH DIFFERENT 
COMBINATION COMMON TYPE NO MOVEMENT TYPES OF 
OF MOVEMENT WAS OBSERVED MOVEMENT 
OCCURRED 


OBSERVED 
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56.7% of the trials. There was certainly a marked inability to perceive 
movement in this experiment. 

Almost every conceivable type of movement effect and a good number 
that were incredible until they were actually reported may be found in 
this experiment. Fusion effects, besides those previously recorded, 
were experienced in plane, third-dimensional, and angular varieties. 
Expansion-contraction was a general category of high frequency within 
which oceurred many novel transformations of one figure into another 
and back again; these were also complicated by variability in the plane 
of the movements. 

Figures were reported as independent objects circling each other in 
Space, almost as binary stars. A phenomenon similar to this presented a 
revolving cube on the four faces of which the two exposed figures alter- 
nated with two blank spaces. In the cases of stimuli involving an arrow 
or a half-dumbbell a frequent report was that this figure crashed through 
or was shot out of the other. Series III bore reports that the arrowhead 
actually appeared conical, so when this figure pivoted 90 degrees in the 
third dimension, the subject saw only the largest cireumference of the 
cone, a circle whieh was the other figure. Had the actual dimensions of 
the widest part of the arrowhead (the base of the assumed cone) and 
the diameter of the circle been respected, this would have necessitated 
a report that the arrowhead expanded to approximate the size of the 
circle. But this was overlooked, leading one to suspect that the real 
diameters were adjusted to apparent equality. 

A noteworthy effect observed in all cases, though most common in 
Series VI, was the report that the figures became blurred and fuzzy. 
This was particularly the case when subjects reported that a combination 
was rarely conducive to movement. The figures then appeared to lose 
their contour characteristics, and the subject would report movement 
that changed one indistinct figure into another indistinct figure, this 
change taking place through a nebulous haze. From the point of view of 
the functional relation between such forms, the indistinct figures actually 
looked much more like each other than they did with sharp contours. The 
possibility that a change of accommodation may have contributed to this 
effect will be considered in the discussion. 


C. Subjective Estimates of the Relative Ease for the Perception 
of Apparent Movement 


Subjects were asked to designate which of the two combinations were 
observed to move more readily. Since there were different groups of 
Subjects for each of the five series of this experiment, every subject 
was prepared to estimate only those he observed; thus, inter-series com- 
parisons eannot be made. Within each series, however, the pattern is 
perfectly obvious. And this is practically a chance choice, the meaning of 
which must be that subjects could not consistently agree that some com- 
binations moved more readily than others. As a matter of fact, there was 
probably мету little choice Троса recente тах he found in Tahle 9 Tn 
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Table 3 
Averages of the Subjects’ Estimates, by Ranking, of the Ease with 
Which the Combinations of Series III VII Appeared to Move 


SERIES COMBINATION AVERAGE RANK 
ш А 15 
B 1.8 
IV A 1.4 
B 1.6 
y A 1.3 
B 17 
МЕ А 16 
B 14 
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different types of movement. The same influence makes even the most 
prominent type of movement appear infrequently as compared with other 
combinations. 

The key to the whole situation may be the interesting fact that the 
figure combinations for which it was most difficult to perceive motion 
lost their sharp outlines and became blurred. The result of this blurring 
was to make the indistinct masses more like each other, the condition 
under which movement was perceived. This blurring effect might not 
be the case with greater exposition times. When there was no movement, 
blurring was not often reported. 

It would appear that there is a strong tendency to make the figures 
appear equal in movement. When the forms differ, most such similarities 
are ruled out. Then fusion may appear. Beyond that, certain differences 
are overlooked in the setting up of apparent working constancies for 
which there is no complete basis in fact, e.g., the cases of the conical 
arrowhead (Series III) and the four-squares figure (Series VII B). Should 
the actual differences be too awkward to be overlooked in this manner, 
certain releases from the immediate contexts were reported. Here, 
reference is made to the appearance of the two figures independently 
circling each other in space and the invention of a revolving figure on 
whose four faces the two figures alternated with the blank spaces. These 
effects would, of course, be complicated by the lack of stability of any 
particular movement, so that the subject would report two or three types 
of movement on a single trial, the patternization of which is very likely 
influenced by the time-interval. And then a final resolution of different 
figures into a movement relationship might require the actual oblitera- 
tion of the differences (never complete) through appearances which by 
their very indistinctness made possible some similarities. When no simi- 
larity of form exists, the effects which occur seem to disclose fictitious 
similarity. From an analysis of the reported phenomena, subjects at- 
tempt to set up some fictitious similarity. | 

A change in visual accommodation may have produced the blurring 
of figural characteristics which accompany perceived movement under 
those conditions. The experiment was conducted in the dark, and there 
Was neither the opportunity nor apparatus to determine the involvement 
of convergence or accommodation in the phenomena. The suggestion of 
an accommodation effect, therefore, is speculative and inferential. Should 
it prove that accommodation is involved, it implies that a stimulus char- 
acteristic (the figure received is a blurred one) and not a perceptual ren- 
dering (the blurring of a clear figure) would explain the phenomenon. 
A complete explanation is not to be found in the fact that accommodation 
is implicated. Thus remaining unaccounted [for] would be the fact that 
accommodation was imperfect only on trials for which blurring oceurred— 
and that it did not ordinarily change for similar figures. The present 
explanation poses no antagonism between the alternative hypotheses, 
and the observation that marked form-differences are accompanied by 
blurring (however produced) may remain uncontested. 
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presentation. And to this must be added the import of the present dis- 
cussion. The real differences tend to be disregarded or actively reduced, 
and novel organizations may resolve them by creating new figures. 


SUMMARY 


This experiment set the problem of studying the possible breakdown 
of apparent movement by combinations of markedly dissimilar figures. 
The range of movement was considerably reduced by such combinations, 
but movement was not completely eliminated. Many different types of 
movements were observed and analyzed, and there were many trials on 
which movements could not be perceived. Subjects were not able to judge 
that some combinations rendered movement more easily than others. 
An explanation was offered for the occurrence of the residual movement 
effects which held the apprehension of apparent movement to be a per- 
ceptual process, actually a constancy effect, whereby objective differ- 
ences tend to be disregarded or actively reduced. 
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retaining a location and building a new figure within it, the visual system 
performs the latter task” (Kolers, 1972, p. 54). For example, when he 
showed (Figure 46, Array 5) in succession the stimuli displayed here in 
A and B of Figure 1, no motion was perceived; rather, the extreme fig- 
ures changed shape, and the innermost ones blinked on and off. Kolers’ 
demonstrations indicate that the visual system exhibits location inertia 
rather than preservation of figural identity. To complete this line of 
argumentation, one should show that preservation of identity does not 
affect the perception of motion even when it does not conflict with main- 
tenance of locations. 

For this purpose I designed several situations of ambiguous apparent 
motion, each with several interpretations. Each interpretation involves 
the same degree of change of locations. One interpretation of each situa- 
tion satisfies the principle of preservation of figural identity. The others 
do not. If figural identity has any effect at all on the formation of motion 
impression, then the interpretation in which figural identity is preserved 
should be the one most likely to be perceived. A failure of figural identity 
to bias perception would be another indication of its irrelevance in de- 
termining perceived motion. 


METHOD 


Stimuli 


Each different situation of apparent motion can be called an episode. 
An episode is characterized by the appearance of a cycle of two or six 
consecutive different displays, and it consists of a number of iterations 
of this cycle. Each display was presented for 150 msec with а 30-msec 
time interval between any two consecutive displays as well as between 
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FIGURE 1. The displays alternated by Kolers (adapted from Kolers, 1972, Figure 46, 
Array 5). 


E 
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Subjects and Response Mode 


In contrast with the convention in apparent-motion studies to present 
few experienced subjects with many repetitions of the same stimuli 
(possibly with a change of some of the display parameters), I used naive 
subjects and presented them with an episode only once. My consideration 
was that if a visual ambiguity is really ambiguous, that is to say, the 
different interpretations are perceptually possible and likely, then once 
one is familiar with all the interpretations, he may be inclined to impose 
one of them on the visual field according to considerations which are not 
perceptual. For example, he may try to force himself to see the identity- 
preserving motion, because it reconciles best his percept with his con- 
jecture about the process of generating the stimuli. Therefore, the 
Subjects saw an episode only until their first interpretation was super- 
Seded by a second one, if at all. They reported their first interpretation 
(and the second one if there was any). The approximate duration for 
Which the first interpretation persisted was recorded in a way described 
їп the Procedure section. 

Subjects were 26 nonpaid undergraduates at the University of Cali- 
fornia, San Diego, who participated in the experiment as part of their 
course requirement in the department of psychology. Eight subjects 
were unable to experience ordinary beta horizontal motion in the pre- 
liminary stage of the experiment, so although they did participate in 
the rest of the experiment (to avoid the need for a cover story), their 
data were not included in the analysis. No subjects were familiar with 


the Hebrew alphabet. 


Design 

The episodes were classified into three groups: (a) Episodes 1, 2, 3; 
(b) Episodes 4, 5, 6; and (c) Episodes 7, 8, 9 (see Figure 2). Episodes of 
Опе group were presented consecutively; thus the sequence of trials may 
be conceived as forming blocks of three trials each. Groups and vocabu- 
laries were paired and assigned to blocks according to a 3x3 Graeco- 

айп square. The episodes were ordered within blocks according to 
three orthogonal Latin squares, one for each block. Each subject was 
assigned to one sequence of the Graeco-Latin square which dominated 
the three orthogonal Latin squares, and in addition to one of the arrange- 


ment conditions. 


Apparatus and Setting 

The main equipment consisted of a display Tektronix oscilloscope 
(which decayed completely in .9 msec at the luminance level used) and 
a typewriter keyboard, both controlled by a PDP-9 computer in an acous- 
tically isolated booth. The subject sat in the booth in front of a table. 
The keyboard was positioned on the table at about elbow level directly 
In front of the subject. The display oscilloscope was positioned in front 
9f the subject at about eye level. Viewing distance was about 45 em. 
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The length of an element was about 4 mm, and the diameter of the imagi- 
nary circle circumscribing the elements in Episodes 4—9 was about 60 
mm. Thus the longer side of an element subtended about 0°80’ visual 
angle for the subject, and the whole relevant visual field subtended about 
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the key, the display was terminated after 10.7 sec. The time the subject 
took before depressing the key, if at all, was measured. After the termi- 
nation of the display, the subject described verbally his interpretation(s) 
of the episode, and in case of difficulties in verbal communication the 
subject could use the Erector Set pieces to physically demonstrate his 
experience. 

As said earlier, exposure time was about ? 150 msec, and ISI (inter- 
stimulus interval: interarray and intercycle intervals) was about 30 msec. 
Those parameters were selected on the basis of both former findings 
(Kolers, 1964) and some informal studies with myself and several other 
colleagues as subjects. 


RESULTS AND DISCUSSION 


The arrangement condition (i.e., Ferris wheel versus roulette) did 
not seem to affect the results at all, so data are pooled across conditions. 
The type of vocabulary interacts with the results only in one case, so 
data are pooled across vocabularies except for that single case. . 

The crossing slides episode. Seventeen subjects reported a vertical 
motion in at least one of their interpretations. No subject reported a 
crossing diagonal motion in either of the two interpretations. One subject 
reported a seesaw motion. The result is highly significant, x? (2) = 26.86, 
P < .001. Thus on the basis of both the present result and Kolers' similar 
finding (Kolers, 1972, pp. 76-79), it can be concluded that the failure to 
See crossing diagonal apparent motion is a pervasive phenomenon that 
18 probably not affected at all by preservation of identity. Although in- 
teresting, this is somewhat weak evidence for the unimportance of the 
Principle of preservation of identity because it may be argued that effec- 
tive as this principle may be, it may still fail if it leads to a perceptual 
absurdity, such as the presence of two visual objects at the same location 
at a certain moment, which is implied by the crossing paths. The failure 
to see diagonal motion can also be attributed to the size difference be- 
tween the vertical and the diagonal paths. More direct evidence is pro- 
vided by the results of the other episodes. | 

he bd episodes. The preservation of figural identity should каше 
the ambiguity of Episode 2 in favor of the opposite horizontal motion o 
Seesaw motion about the vertical axis) and the ambiguity of ays 
1 favor of the opposite vertical motion (or seesaw motion ex t E 
orizontal axis). If the effect of identity is partial but Is stil p , 
Several possibilities exist. First, both motions will be seen alternat ely, 
but the first to appear or the more compelling will be the one preser ving 
the identity of the elements. Second, one type of motion Is a pr lori uri 
likely to be experienced; thus it will be the only interpretation in S 
Episode in which the motion preserves the identity of the elements. in 
the episode in which the other type of motion preserves the ү 
Owever, both motions will be experienced. Third, both motione ма Le 
*Xperienced in both episodes, and the first interpretation will be the 
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тл Да tilt was seen by the subjects in at least one of their two 
ра я Here there seems to be an effect to the vocabulary of 
hy ee E As can be seen in Table 1, the disparity between the dia- 
Eam Чч letters was not sufficient to override the tendency to see 
КЕ пер lon, whereas the diamond was different enough from the line 
d 2 to induce the tilting interpretation. (Fisher’s exact probability 
ath i other interpretations" is .004.) It thus appears that 
et Strict identity of shapes is unnecessary and even irrelevant for 
аи ng motion between successive stimuli, nevertheless some sort 
па arity between the stimuli has to be maintained. A more thorough 
jtm sion of the possible aspects of that similarity will be presented 
* Даны episodes. Here the figural identity of the elements 
Ык. xdi bya counterclockwise cireular motion of all the elements. An 
PI ie ive interpretation is that of the elements staying in the same 
ee but changing shapes. This is a clear instance of superiority of 
ti fe Inertia over preservation of figural identity. In still other inter- 
i б ions the perceived identity of one of the elements induces just a 
This ar motion of itself while the other elements are seen as stationary.® 
eom often happened in the five diamonds and one diamond versions, 
Where one element was clearly discernible from the other five. Any sub- 
Dis that saw any kind of circular motion in at least one of his interpreta- 
Pune of an episode is classified under the heading “Circular” in Table 
r subjects who did not see any motion at all are classified under the 
oe “Stationary.” As can be seen, the impact of shape preservation 
at almost does not exist in the different diamonds version is consider- 
rea effective in the five diamonds version and even more so in the one 
я bred version, Cochran's Q (3) = 20.46, p < .001. In view of this 
ing and of the result described in Table 1, it seems that the type of 
motion perceived is sometimes, although not often, affected by the output 
"i processes which identify the objects in the scene of an episode. So 
e observed impotence of the principle of preservation of figural identity 
probably cannot be explained by a process in which figural analysis neces- 
Sarily succeeds motion analysis. 
In order to gain better understanding o 
tween these two types of analysis, let us examine care 
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Table 1 
Frequency of Different Interpretations of the “One Diamond 
Cha-Cha-Cha” Episode According to Type of Vocabulary 
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Attneave (1974), Breitmeyer, Love, & Wepman (1974), and Kolers 
and Pomerantz (1971) discussed the relationship between ISI and the 
degree to which apparent motion is affected by shape. However, the 
Phenomena demonstrated in this article appear to be unaffected by 
changes in ISI. In informal observations made on several colleagues and 
myself with ISI values of 0, 150, 200, and 300 msec, the motion appeared 
less compelling but essentially unchanged. At an ISI of 400 msec, one 
can hardly see any continuous motion at all. 

Anstis (1970) proposed that in apparent motion “each point in the field 
at time t, is perceived as moving towards the nearest point that happens 
to have the same brightness at time t” (р. 1428). In an informal study 
I presented several colleagues with an episode similar to Episode 2, 
except that the figures were two diamonds, one of which was three times 
as intense as the other one. The favored motion was a vertical one over 
a wide range of intensity settings. Thus in this case, figural identity is 
not the only attribute which is not preserved; brightness is also not 
Preserved. 


Some Theoretical Implications 


Finally, some hypothesis about the processes involved in perceiving 
motion may explain most of the discussed findings. First, it should be 
noted that the exposure time of 150 msec used in the experiment was 
Certainly sufficient for figure identification, as evidenced both by the 
Pretests in which subjects by and large did not fail to recognize any of 
the probe letters and by the fact that subjects identified elements during 
the Presentation of the episodes. Thus figural analysis is performed, 
although it usually does not bear on the motion perceived. Any model 
of motion perception will have to take this into consideration. Kolers 
accounts for it by hypothesizing two functionally independent systems, 
Опе for pattern analysis and one for space-and-motion analysis, whose 
Outputs are coordinated by a higher order assessor (Kolers, 1972, pp. 
114-116), This assumption is unnecessary because a unified system can 
also yield the same phenomena. у : 

Assume that the formation of motion perception is a process of looking 
Or the best account for disparities in consecutive static displays. The 

€cision must be made very fast in order to maintain a continuous and 
Coherent percept, even at the cost of ignoring some information that is 
Slowly obtained or interpreted. To explain the precedence of location 
Information over figural identity, one should simply consider the kind 


of info: i » hich determines the motion receives. Sup- 
rmation the process w| to perform than 


Pose that the process of determining shape takes longer 

© process e determining location (e.g., as suggested by eos 
Fehrer & Raab, 1962, and Thorson, Lange, & Biederman-Thor son, 1969). 
Then the process which is analyzing motion would receive information 
about location or global configuration before it would receive information 
about shape. Figural information may arrive too late to be useful. That 
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properties of the global configuration in the array itself or the short-term history of changes 
of locations from array to array. 

4. Five subjects (four of them in the roulette condition) reported that the circle seemed 
to expand before switching directions of movement. 

5. Actually some subjects reported seeing an element going around a stationary circle 
made out of other elements, which entail that it occludes different parts of the circle while 
moving. This observation is inconsistent with Kolers’ claim (Kolers, 1972, p. 81) that illusory 
motion, which allows for two objects to occupy the same place at the same time, is impossible. 
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edel component in psychotherapy is the use of corrective feed- 
Steerer = been especially evident during the past 15 years, with the 
therapy е of many new approaches to treatment, including behavior 
been ei re and encounter groups, and what have sometimes 
а ed “video” therapies. In all of these approaches, the patient 
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tones for specific electroencephalogram changes. It was reasoned that 
through the feedback and resulting discrimination, subjects would be 
able to develop voluntary control over these functions. 

It is important to consider the history and scope of the issues raised 
by this research in order to more fully appreciate its implications and 
applications to clinical treatment. In particular, biofeedback has stimu- 
lated renewed interest in concepts such as voluntary control and con- 
sciousness and in the central question of the degree to which man can 
modify and control his behavior in the face of opposing forces in the 
environment. Consequently, the role of the individual patient in taking 
responsibility for the maintenance and control of his own health becomes 
of paramount importance. Although biofeedback is a form of behavior 
therapy, this does not alter the basic issues involved, especially since 
questions of self-control and cognition are becoming central issues in 
current behavioral approaches to treatment. 

Over 250 papers have been published on the use of operant-feedback 
techniques in the control of physiological processes, and four volumes 
reprinting much of this work are now available (Barber et al., 1971; 
Kamiya et al., 1971; Stoyva et al., 1972: Shapiro et al., 1978). My wish is 
not to review this work nor to cover all the potential clinical applications 
to date, since this material has been recently reviewed elsewhere (Shapiro 
& Schwartz, 1972). Rather, this article considers some of the important 
issues or problems involved in clinically applying biofeedback techniques 
to the individual patient. Toward this end I will discuss some of our basic 
and clinical research on the voluntary control of human cardiovascular 
responses, for it illustrates many of the problems involved. 


BASIC RESEARCH 


The possible application of biofeedback to the control of essential 
hypertension (high blood pressure of unknown etiology) first requires 
the development of a suitable blood pressure measurement and feedback 
system. A prerequisite for any biofeedback application is the accurate 
sensing and processing of physiological activity, and not surprisingly 
one of the reasons why the field took so long in developing was that readily 
available biomedical technology, including accurate programming equip- 
ment, has only appeared in the past 15 years. 

Actually, the technology for sensing and recording human blood pres- 
Sure on a continuous basis was already available, but it was not practical 
for routine laboratory work since it required surgical insertion of a 
pressure-transducing tube into an artery. On the other hand, while the 
traditional stethoscope and blood pressure cuff procedure used by the 
physician was neither aversive nor dangerous, it was too intermittent 
(one could only obtain about two readings per minute) and too inaccu- 
rate to make it satisfactory as a feedback device. However using new 
pressure-regulating devices and solid-state circuitry (coupled with an 
extension of blood pressure measurement theory), it was possible to 
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modify and automate a constant cuff pressure system to accurately mea- 
sure median blood pressure (+2 millimeters of mercury) and at the same 
time provide the subject with binary (yes-no) feedback at each heart 
beat for increases or decreases in blood pressure (Tursky, Shapiro, & 
Schwartz, 1972). The details of this system need not concern us here— 
the point is to highlight the general problem of being able to detect and 
measure the function in question so that it can be easily and practically 
used by the patient. 

In our Базе research (as well as clinical) studies to date, we have 
typically used the procedure of giving the subject binary feedback for 
small, desired changes in blood pressure (or heart rate) at each beat of 
the heart and, after producing a certain number of correct responses, 
of giving the subject a reward or incentive in the form of a slide.? The 
slides might be landscapes of Boston, pictures from around the world, 
photos of nude women, or numbers reflecting monetary bonuses. As will 
be seen, in multisession work involving patients, maintaining interest 
and motivation can become an important problem, one not easily solved 
by using simple extrinsic rewards. 

In the first two studies (Shapiro, Tursky, Gershon, & Stern, 1969; 
Shapiro, Tursky, & Schwartz, 1970a) the basic question was whether 
healthy college students could learn to raise or lower their systolic blood 
pressure in a single session. All subjects received the same general in- 
structions, and only the contingency of the feedback and reward was 
varied between groups. To control for possible instructional effects, 
subjects were not told what the bodily function was, nor were they told 
in what direction the function was to change. The results indicated that 
subjects reinforced for increases in blood pressure were able to raise or 
maintain their pressure, while subjects reinforced for decreases in sys- 
tolic pressure were able to lower their pressure over the session. Another 
group given random reinforcement showed that its average pressure fell 
about midway between the increase and decrease groups, suggesting 
that the two contingent groups controlled their pressure above and 
beyond the nonspecific effects of the instructions, the stimulation af- 
forded by the feedback and reward, and quiet sitting in a lounge chair 
for 40 minutes (Crider, Schwartz, & Shnidman, 1969). Interestingly, 
postexperimental questionnaires revealed that subjects were not aware 
they had actually controlled their blood pressure, nor did they realize 
the direction their pressure had changed. 

Some questions consistently raised by this research were, What were 
the subjects doing or thinking to control their blood pressures? Were 
they getting excited or relaxed? Were they thinking specific thoughts 
or possibly tensing their muscles in some way? Hence, questions of cog- 
nitive or somatic mediation were considered to be important, especially 
by those researchers interested in the theoretical question of whether 
operant conditioning of autonomic activity per se had in fact taken place 
(Katkin & Murray, 1968). As it turned out, the blood pressure data proved 
to be convincing in this regard, because when heart rate (a related cardio- 
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vascular function) was analyzed, no differences were found between 
groups. In other words, the subjects had not learned simply to raise or 
lower overall autonomic arousal, even within the cardiovascular system. 
In a later study (Shapiro, Tursky, & Schwartz, 1970b), when subjects 
were trained to raise and lower their heart rate using a similar binary 
feedback and reward technique, the subjects now learned to control their 
heart rate, without similarly affecting their systolic blood pressure. 
Altogether, findings of specificity of learning such as these were seen 
as important prerequisites for the application of biofeedback training 
to clinical problems (Shapiro & Schwartz, 1972), since alleviation of the 
symptom was desired without corresponding side effects such as those 
sometimes produced by drugs. 

However, questions of mediation and specificity are more complex 
than originally envisioned (Schwartz, 1972) and have important implica- 
tions for therapy (Schwartz, in press). On the one hand, the therapist 
is not usually concerned with obtaining a pure effect from a scientific 
point of view, but wants any combination of factors that will produce 
large, long-lasting, generalizable, and safe effects. Thus, if for a given 
hypertensive patient it was found that training in general muscle relaxa- 
tion alone produced the most rapid and massive decreases in pressure, 
this would be clinically valuable. Cognitive and somatic mediation, if 
useful, is of significance to the therapist. 

On the other hand, the therapist may be concerned with what else 
is changing when a person is given biofeedback for a particular function. 
In fact, there are instances where the therapist may be interested in 
patterns of responses. For example, the desired goal for those hyper- 
tensive patients having normal heart rates may be to lower stroke vol- 
ee and/or peripheral resistance rather than to change heart rate per se. 
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and vice versa, then if feedback and reward were given for one, the other 
would simultaneously receive the opposite contingency. Both functions 
should also be learned, only now in opposite directions. However, since 
neither of these results was empirically obtained, it would follow that 
systolic blood pressure and heart rate are unrelated in such a way that 
binary feedback for one results in some form of simultaneous random 
reinforcement of the other. 

Assuming for the moment that this conclusion is true, the next ques- 
tion that arises is, How could a person be taught to control both of them? 
One approach that follows from the above analysis is to give the feedback 
and reward only when the desired pattern of responses occurs. In theory, 
it should be possible to teach a person to voluntarily integrate his systolic 
blood pressure and heart rate (make both functions increase or decrease 
together), or to differentiate them (make them go in opposite directions) 
by providing feedback and reward for the desired pattern. In subsequent 
studies, it has been verified that systolic blood pressure and heart rate 
are only integrated 50% under resting conditions (they spontaneously 
change from beat to beat in the same direction only half of the time) and 
that subjects can learn voluntarily to make them go together (Schwartz, 
1972; Schwartz, Shapiro, & Tursky, 1971) or apart (Schwartz, 1972) when 
given feedback for the appropriate pattern. 

Thus the interrelationships among bodily functions over time, as well 
as the exact nature of the contingency (feedback) involved, control the 
degree to which learned specificity or patterning occurs. Although this 
type of formulation may provide a useful first step in the prediction and 
control of multifunctions using biofeedback, it must be incomplete, for 
it treats physiology as if it operated in a vacuum, without constraints. 
However, physiological (and associated cognitive) constraints do exist 
and must be taken into account (Schwartz, 1972). The extent to which 
simple predictions from learning or feedback theory fail can reveal the 
exact nature of the constraints involved. 

For example, it is easier for subjects to learn to make their systolic 
blood pressure and heart rate go together than to make them go apart 
(Schwartz, 1972). Furthermore, feedback for integrating these functions 
produces somewhat larger effects and more rapid learning than feedback 
for either single function alone. In addition, when subjects are taught to 
lower both functions simultaneously, they now begin to report feelings 
of relaxation and calmness, states one would expect to be associated with 
more general autonomic relaxation. Altogether, it would seem that in the 
process of trying to understand and extend biofeedback techniques to 
patterns of responses, the research uncovers more about the nature of 
the physiological systems themselves and their relation to cognition. 3 

The combined behavioral-biological approach to biofeedback outlined 
here has a number of implications for treatment. One is that the clinician 
should carefully evaluate exactly how he administers feedback to an 
individual patient so that he is better able to precisely determine and 
control what the patient will and will not learn. Understandably, this 
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applies not only to simple discrete-digital feedback but to more complex 
forms of continuous-analog feedback as well. For example, if a patient is 
receiving analog feedback for beat by beat changes in heart rate, and he 
has sinus arrhythmia (heart rate rising and falling with each breath), 
the subject will be simultaneously receiving feedback for respiration, and 
he may modify it accordingly. The clinician should be aware of this possi- 
bility and should make a rational decision as to whether or not such learn- 
ing is desirable. | 

For applications involving the self-control of more complex states like 
anxiety or fear, it becomes important to determine what function or 
pattern of functions occurs in these states for the individual patient. 
It is well known that individuals differ in their autonomic specificity and 
that situations may evoke different patterns (Lacey, 1967); thus it makes 
good sense that the feedback be selected so as to optimize its integration 
(correlation) with the problem in question. 

As for the question of constraints, it follows that if it is possible to 
clinically evaluate the flexibility or variability of the function(s) in ques- 
tion, it may be possible to assess whether biofeedback will ultimately 
work for a given patient. For example, if a patient with fixed hyper- 
tension (possibly due to excessive hardening of the arteries) has blood 
pressure levels constantly elevated above normal, it seems unlikely that 
biofeedback will have any significant effect on purely physiological 
grounds. Evaluating the potential for organ change (e.g., by taking sleep 
measurements) may be a useful screening technique. It follows that early 
diagnosis of the individual for preventive feedback training before the 
organ system is seriously damaged may be the most meaningful applica- 
tion to psychosomatic medicine. 

Before considering further issues raised by clinical work, it may be 
useful to show how this model applies to animal biofeedback research, 
where curare is used to paralyze the muscles. Miller (1969) and DiCara 
(1970) have argued that by paralyzing the muscles with curare, the ques- 
tion of somatic mediation is eliminated. However, all the curare pro- 
cedure (using moderate doses) does is eliminate potential feedback from 
actual muscle contraction; it does not stop the rat from struggling and 
tensing his muscles “in his head." According to the model, to the extent 


move in his head," that is, to produce 
Ippocampus (in a nonparalyzed dog, 
S voluntary movements), heart rate 


ases as well. Recently, Goesling and 
Brener (1972) have found that teaching a rat to tense or 


h t in relax prior to 
curare greatly influences his later ability to raise or lower his heart rate 
while under curare. Hence, the question of mediation is c 


ү t omplex, espe- 
cially at the level of the brain, and one must evaluate (a) relations be- 


Biofeedback as Therapy: Some Theoretical and Practical Issues 7 


tween responses, (b) the nature of the reinforcement contingency, and 
(c) natural biological, environmental, and “state” constraints in order to 
understand and predict exactly what is being learned (Schwartz, in press). 


PRELIMINARY CLINICAL APPLICATIONS TO 
ESSENTIAL HYPERTENSION AND RAYNAUD'S DISEASE 


We have attempted to apply the general procedures developed in the 
basic research to patients diagnosed with essential hypertension (Benson, 
Shapiro, Tursky, & Schwartz, 1971). First, the seven patients partici- 
pated in from 5 to 16 control sessions, during which blood pressures were 
taken under resting conditions. Then systolic blood pressure feedback 
and reward sessions were initiated and continued until no further reduc- 
tions in blood pressure occurred for 5 consecutive sessions. The number 
of conditioning sessions varied from 8 to 34, with a mean of about 22 
sessions. The results showed that six of the seven patients decreased 
their pressure by as little as 3.5 millimeters of mercury and by as much 
as 33.8 millimeters of mercury (mean decrease for all patients was 16.5 
millimeters of mercury). Interestingly, the one patient who failed to show 
any decrease in blood pressure was later diagnosed to have renal artery 
stenosis, which relates back to the question of biological constraints. 

Although these findings are encouraging, they are a long way from 
demonstrating that biofeedback can be a clinical treatment for hyper- 
tension. First, it is not known whether patients can control their pressure 
without the feedback or even if the feedback was the significant factor 
in the research. Placebo variables may have been operating and can only 
be evaluated by the use of the appropriate control groups (Shapiro & 
Schwartz, 1972). On the other hand, as in all forms of therapy, positive 
expectations on the part of the patient may well be a prerequisite for 
positive gains, especially in behavioral therapies where patient coopera- 
tion and practice are necessary for change. 

Related is the general question of patient motivation. As described 
in the review of the clinical research (Shapiro & Schwartz, 1972): 


Central to all forms of psychotherapy is patient motivation and involvement. The 
question, at the simplest level, is how to motivate patients to spend the necessary 
time practicing the desired behavior. With severely retarded and/or emotionally 
disturbed children, reinforcers such as food, candy, or other desired objects (e.g., 
watching a fan spin) are often necessary for modifying the behavior. However, with 
less severe problems, therapist praise, praise from significant others in the patient's 
environment, or self-reinforcement (a sense of competence) may be more than suf- 
ficient. One predictor of success in psychotherapy is the extent to which the patient's 
problems are causing him pain or suffering. With regard to biofeedback, Miller has 
suggested that feedback, for controlling the function in and of itself may be all that 
is necessary. However, as progress is made, and the novelty of the situation wears 
off, what will keep the patient working at the task? In our experience with patients 
suffering from essential hypertension, a potpourri of rewards was successful, in- 
cluding money for participating and for succeeding (most of these patients were on 
welfare and thus the money they earned was quite significant), slides of scenes 
around the world, and general, non-specific praise from the physician and research 
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assistant running the sessions. However, this was an experiment, and the patients 
were receiving a sizeable monetary reward. One wonders whether they would have 
spent the same number of hours trying to control their blood pressure in the absence 
of such incentives, which are impractical on a large-scale basis. In this respect, 
learning to control physiologic processes is like any other self-control procedure 
(e.g., dieting) since sacrifice on the patient’s part may be necessary for the sake of 
health [p. 180]. 


Actually, the hypertensive patient may be a special case, more similar 
to the average cigarette smoker than to the headache patient. Although 
the pain caused by tension or migraine headaches is relatively immediate, 
as is its relief, it may be years before the potential harm from high blood 
pressure or smoking is experienced directly by the patient. Hence it is 
understandable why headache patients are willing to pay for experi- 
mental biofeedback treatment (T. Budzynski, personal communication, 
1971), while we must pay our research hypertensive patients to poten- 
tially help themselves. 

The question of motivation leads to another important problem: namely, 
whether the patient is able or willing to control his bodily functions in 
the hectic, rapidly changing environment outside of the laboratory. It is 
well known that environmental stressors can augment or even cause 
psychosomatic disorders, and it should not be surprising if many patients 
desire biofeedback for their blood pressure, just as they take medicine 
for their stress-related stomachaches. Simply put, the typical patient 
would rather change his body than change his life-style or his environ- 
ment, the two factors which together augment or cause the bodily dys- 
functions in the first place. In this context it would seem reasonable that 
biofeedback should be viewed as but one approach to the treatment of 
the total person, realizing that to cure a problem such as hypertension 

will require more than just the patient consciously attempting to lower 


his pressure. Similar views concerning behavioral therapies i 
have been stressed by Lazarus (1971). MS, 


: А good example of this 


other aspects of the patient's behavior (e 
cies of reward) and personality which are 

Depending on the particular function, 
volved, biofeedback trainin 
committed to spending th 
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him to travel to the laboratory for each session often becomes a formida- 
ble problem. For this reason the development of home feedback devices, 
coupled with weekly consultation with the physician and/or therapist, 
appears to be a fruitful direction in which to move. In addition, the use 
of portable feedback devices makes it possible for the patient to eval- 
uate his progress under different environmental conditions outside of 
the laboratory, and this may significantly enhance the potential for 
generalization. 

Whether or not the patient can learn to control these processes without 
feedback is still an open and important question. Procedures for weaning 
the patient from the feedback need to be worked out (Weiss & Engel, 
1971). However, patients may be able to learn certain cognitive or so- 
matic strategies which they then can use without feedback. One ex- 
ample of this comes from a patient we were experimentally treating for 
Raynaud’s disease, a problem of peripheral constriction and reduced 
blood flow that under extreme conditions can lead to gangrene in the 
hands and feet. Symptoms of Raynaud’s disease include a feeling of cold 
in the afflicted area that can be quite painful. The patient was a man in 
his early sixties with very cold feet; this patient was somewhat unusual 
in that he was a practicing psychoanalyst who came for behavioral treat- 
ment of his problem. Briefly, blood volume was recorded simultaneously 
from the big toes of each foot, and the patient was given binary feedback 
for each heart beat that was followed by a small increase in blood volume. 
The feedback and reward procedure was modeled after the blood pres- 
sure work, and similar slides were used. Although placebo and expec- 
tancy effects cannot be discounted (and were probably operating), over 
10 sessions he began to show large increases in blood volume, particularly 
in the left foot, the foot initially selected for feedback and reward. In the 
middle of a later session we then switched to providing feedback and 
reward for the right foot, which subsequently reached similar increases 
in overall blood volume. This finding of relative specificity was not sur- 
prising because it was observed on the polygraph that spontaneous dila- 
tions and constrictions frequently occurred in one foot and not the other, 
and therefore the feedback was really contingent to the particular foot 
controlling the programming equipment, while the other simultaneously 
received a form of random reinforcement. 

This particular patient was unusually inclined toward free association 
and interpreted the purpose of the slides as diverse material for him to 
covertly free associate to. In one of the later sessions the slide projector 
accidentally jammed and, rather than showing a slide, flashed just a 
white, bright light on the screen. This proved to be a fortunate accident, 
for the patient began free associating to the sun, warmth, beaches, and 
so forth, and he reported that these thoughts were particularly helpful 
in warming his feet. Furthermore, these “hot thoughts,” as the patient 
described them, were the images he used to control his feet outside of 
the laboratory. Interestingly, this procedure proved to be highly adap- 
tive for him since if his feet got cold, even while he was doing therapy 
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with patients, he reported that all he had to do was "turn on his hot 
thoughts" for a few moments and he felt relief. Whether or not he actu- 
ally was able to control his feet out of the laboratory is not known, nor 
is it known if these images were really the signifieant mechanisms by 
which he controlled his blood volume. Nonetheless, the case illustrates 
the potential use of cognitive mediation in the development of self-control 
of bodily processes. While this patient remained symptom free for about 
a year and a half, he later contacted us indicating that his symptoms had 
returned and that he desired further training. 

This patient was highly educated (and high up on the socioeconomic 
ladder), experienced and comfortable in laboratory situations, and greatly 
desirous and expectant of success. The other Raynaud's patient we have 
tried to work with so far was from a lower-middle-class background, 
inexperienced and wary of the laboratory setting, and of questionable 
motivation and expectancy concerning exactly what the feedback really 
was and what it was supposed to help her to do. This woman was seen 
for fewer than 10 sessions, and there was little indication of clinically 
meaningful (as opposed to statistical) increases in blood volume in her 
hands. Although it is not known if further training would have yielded 
large enough changes to constitute therapeutie success, this seems un- 
likely since she appeared to have a severe case of the disease (she was 
being considered for a sympathectomy); and on top of this, we had little 
Success in improving her attitude and reactions toward the whole situa- 
tion. Her experience illustrates how personality and individual differ- 
ences make it impossible to provide a simple "cookbook" approach to 
questions such as, What should I tell the patient? or What kinds of in- 
centives should I provide? On the other hand, to the extent that specific 
instructions direct a patient's attention to certain aspects of his behavior 
or lead him to utilize certain Strategies in learning control, it can be 
speculated that what the patient is told and what he is asked to do may 
depend on both the Specifie natur 


e of the desired function(s) to be ch 
and the personality of the patient.5 (s) changed 


SUMMARY AND CONCLUSIONS 


Biofeedback is clearl i i 
І y an important discovery, one that may prove t 
have some value in medical and psychological {теа ve 6 


tment. Н 

аге many problems that need to be solved, particularly т 
the individual patient. Biofeedback, if used in conjunction with other 
medical and psychological techniques, may be useful with certain pa- 
tients. However, in the face of specific biological and ое 
constraints, I am somewhat pessimistic about its application to chronic 
physical disease, particularly in the absence of other therapeutic proce- 
dures.” Emphasizing the problems involved is : 


i le not meant to discoura; 
experimental attempts at clinieal application. Although a combined 
behavioral-biological approach that emphasizes (a) the natural relations 


between responses, (b) the exact manner that the feedback and reward 
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is given, and (с) biological, cognitive, and environmental constraints is 
indicated, it is not implied that all of these factors need to be taken into 
account before using biofeedback in a specific case. Rather, the purpose 
of the article is to illustrate potential areas of inquiry and evaluation that 
may be clinically useful. 

Even if direct voluntary control of certain bodily functions or patterns 
of functions proves to have little therapeutic value, biofeedback per se 
can still serve an important function—to signal both the therapist and 
the patient that the patient is currently thinking, feeling, or doing spe- 
cific things that are detrimental to his physical or emotional health. In 
the same way that a scale helps direct the therapist and his obese patient 
in learning how to reduce food consumption and/or increase exercise in 
order to control weight, biofeedback for visceral and neural disorders 
may be so employed as well. By means of immediate, augmented feed- 
back (with its associated increased bodily awareness), the patient may 
be able to learn new ways of coping behaviorally with his environment, or 
he may be able to alter his lifestyle in such a way as to keep his physiologi- 
cal processes within safer limits. In this respect, biofeedback is really simi- 
lar to current psychotherapies, for they all provide corrective feedback.? 


REFERENCES 


Barber, T. X., DiCara, L. V., Kamiya, J., Miller, N. E., Shapiro, D., & Stoyva, J. (Eds.) 
Biofeedback and self-control, 1970: An Aldine annual on the regulation of bodily 
processes and consciousness. Chicago: Aldine-Atherton, 1971. 


Benson, H., Shapiro, D., Tursky, В., & Schwartz, б. Е. Decreased systolic blood pressure 
through operant conditioning techniques in patients with essential hypertension. 
Science, 1971, 173, 740-742. 

Braunwald, E., Epstein, S. E., Glick, G., Wechsler, A. S., & Braunwald, N. S. Relief of 
angina pectoris by electrical stimulation of the carotid-sinus nerves. New England 
Journal of Medicine, 1967, 227, 1278-1283. 

Crider, A., Schwartz, G. E., & Shnidman, S. R. On the criteria for instrumental autonomic 
conditioning: A reply to Katkin and Murray. Psychological Bulletin, 1969, 71, 455-461. 

DiCara, L. V. Learning їп the autonomic nervous system. Scientific American, 1970, 222, 
30-39. 

Goesling, W. J., & Brener, J. Effect of activity and immobility conditioning upon subse- 
quent heart rate conditioning in curarized rats. Journal of Comparative and Physiologi- 
cal Psychology, 1972, 81, 311-317. 

Kamiya, J., DiCara, L. V., Barber, T. X., Miller, N. E., Shapiro, D., & Stoyva, J. (Eds.) 
Biofeedback and self-control: An Aldine reader on the regulation of bodily processes 
and consciousness. Chicago: Aldine-Atherton, 1971. 

Katkin, E. S., & Murray, E. N. Instrumental conditioning of automatically mediated 
behavior: Theoretical and methodological issues. Psychological Bulletin, 1968, 70, 
52-68. 

Lacey, J. Somatie response patterning and stress: Some revisions of activation theory. 
In M. H. Appley & R. Trumbull (Eds.), Psychological stress. New York: Appleton- 
Century-Crofts, 1967. 

Lazarus, A. A. Behavior therapy and beyond. New York: McGraw-Hill, 1971. 

Maslach, C., Marshall, G., & Zimbardo, P. G. Hypnotic control of peripheral skin tempera- 
ture: A case report. Psychophysiology, 1972, 9, 600-605. 


82 CONTROLLING OUR EMOTIONS: BIOFEEDBACK AS THERAPY 


Miller, N. E. Learning of visceral and glandular responses. Science, 1969, 163, 484—445. 

Obrist, P. A., Webb, R. A., Sutterer, J. R., & Howard, J. L. The cardiac-somatic relation- 
ship: Some reformulations. Psychophysiology, 1970, 6, 569-587. 

Schwartz, С. E. Cardiac responses to self-induced thoughts. Psychophysiology, 1971, 8, 
462—467. 

. Voluntary control of human cardiovascular integration and differentiation through 

feedback and reward. Science, 1972, 175, 90-93. 

. Toward a theory of voluntary control of response patterns in the cardiovascular 

system. In P. A. Obrist, A. H. Black, J. Brener, & L. V. DiCara (Eds.), Cardiovascular 

psychophysiology. Chicago: Aldine, in press. 


Schwartz, б. E. Cardiac responses to self-induced thoughts. Psychophysiology, 1971, 8, 
462—467. 


Schwartz, б. E., & Shapiro, D. Biofeedback in essential hypertension: Current findings 
and theoretical concerns. Seminars in Psychiatry, in press. 


Schwartz, G. E., Shapiro, D., & Tursky, B. Learned control of cardiovascular integration 
in man through operant conditioning. Psychosomatic Medicine, 1971, 33, 57-62. 
‚ Self-control of patterns of human diastolic blood pressure and heart rate through 
feedback and reward. Psychophysiology, 1972, 9, 270 (abstract). 


Schwartz, G. E., Shaw, G., & Shapiro, D. Specificity of alpha and heart rate control 
through feedback. Psychophysiology, 1972, 9, 269 (abstract). 


Shapiro, D., Barber, T. X., DiCara, L. V., Kamiya, J., Miller, N. E., & Stoyva, J. (Eds.) 


Biofeedback and self-control, 1972: An Aldine annual on the regulation of bodily 
processes and consciousness. Chicago: Aldine, 1973. 


Shapiro, D., & Schwartz, G. E. Biofeedback and visceral learning: Clinical applications. 
Seminars in Psychiatry, 1972, 4, 171-184. 


Shapiro, D., Schwartz, G. E., & Tursky, B. Control of diastolic blood pressure in man by 
feedback and reinforcement. Psychophysiology, 1972, 9, 296-304. 


Shapiro, D., Tursky, B., Gershon, E., & Stern, M. Effects of feedback and reinforcement 
on the control of human systolic blood pressure. Science, 1969, 163, 588-589. 
Shapiro, D., Tursky, B., & Schwartz, С. E. 


Diy Control of blood pressure in man by operant 
conditioning. Circulation Research, 1970: 


а, 26 (supplement 1), 27, I-27 to I-82, 


5 Differentiation of heart rate and blood pressure in man by operant conditioning. 
Psychosomatic Medicine, 1970b, 32, 417-423. 


Stoyva, J., Barber, T. X., DiCara, L. V., Kamiya, J., Miller, N. E., & Shapiro, D. (Eds.) 
Biofeedback and self-control, 1971: An Aldine annual on the regulation of bodily 
Processes and consciousness, Chicago: Aldine-Atherton, 1972. 

Tursky, B., Shapiro, D., & Schw 
measure average systolic and 
Biomedical Engineering, 


artz, С. Е. Automated constant cuff pressure system to 


diastolic blood pressure in man. IEEE Transactions on 
1972, 19, 271-275. 


ue A Physiological effects of transcendental meditation. Science, 1970, 167, 


Weiss, T, & Engel, B. T. Operant conditionin: 
ventricular contractions. Psychosomatic M 


NOTES 


g of heart rate in patients with premature 
edicine, 1971, 33, 301-321. 


(feedback) and incentive (reward) value is discussed in Schwartz (in press) and Schwartz 
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3. These findings have been extended, and predictions of the model replicated, in current 
research on the voluntary control of diastolic blood pressure and heart rate (Schwartz, 
Shapiro, & Tursky, 1972; Shapiro, Schwartz, & Tursky, 1972). The approach also has been 
applied to the voluntary control of EEG alpha and heart rate (Schwartz, Shaw, & Shapiro, 
1972), but this material will not be reviewed here. 


4, Although there is good psychophysiological evidence for believing that heart rate and 
somatic activity are centrally integrated (Obrist, Webb, Sutterer, & Howard, 1970), this 
does not imply that other autonomic responses are also so integrated. 


5. Specific cognitive control of skin temperature (relative dilation in one hand and con- 
striction in the other through hypnosis and imagery of warmth and cold) has recently been 
reported by Maslach, Marshall, and Zimbardo (1972). Further evidence for cognitive 
control of cardiovascular activity due to specific thought processes can be found in Schwartz 
(1971) and Schwartz and Higgins (1971). 


6. For example, if a subject is instructed to close his eyes and to think a neutral word 
over and over in a relaxed fashion, a reduction in a number of psychophysiological responses 
(a pattern) may occur, such as those seen in certain forms of meditation (Wallace, 1970). 

7. Learning to relax a chronically tense (but otherwise healthy) muscle with biofeedback, 
for example, would seem to be a more likely candidate for success. Also, in this case the 
response in question is likely to be already under some voluntary control since it is part of 
the motor system. This point would similarly apply to learning to control certain electro- 
encephalographie rhythms, to the extent that they involve self-regulatory sensorimotor 
processes. 


8. It might be noted that this approach to the use of biofeedback removes the negative 
clinical implications suggested by recent difficulties in replicating large magnitude effects 
in the curarized rat (N. E. Miller & L. V. DiCara, personal communication, 1972). Rather 
than trying to eliminate those overt somatic “mediators” capable of producing large au- 
tonomie changes, the goal here is to deliberately use them to produce more physiologically 
adaptive reactions (e.g., through breathing exercises or other yoga procedures). 
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SELF-CONTROL OF CARDIAC FUNCTIONING: 
A PROMISE AS YET UNFULFILLED 


Edward B. Blanchard, Tennessee Psychiatric Hospital 
Larry D. Young, Harvard University 


The research in the operant conditioning or self-control of four cardiac 
functions—heart rate level, heart rate variability, blood pressure, and 
cardiac arrhythmias—is reviewed. Although the fact of the self-control 
of these functions seems well established, with few exceptions the magni- 
tude of change is statistically significant rather than clinically signifi- 
cant. A few studies, all involving numerous training sessions, have 
reported large magnitude changes. Several methodological issues are 
raised and discussed. Finally, alternative modes of achieving self- 


control of cardiac function, such as progressive relaxation and Yoga 
exercises are discussed. 
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consideration by this review are studies by Shearn (1962), Frazier (1966), 
and Brener (1966), all of which used a shock avoidance procedure. Basi- 
cally, these studies allowed the subject to avoid mild but painful electrical 
shocks by emitting a heart rate of the appropriate magnitude. These 
studies, which did show that subjects could learn some degree of heart 
rate control, are omitted because such a paradigm was not thought to 
be relevant for clinical practice. 

Inspection of Table 1 reveals that each experimenter has tended to 
use his own individual method of determining if subjects can change their 
heart rate. Although with three exceptions the magnitude of the change 
in heart rate achieved is quite small, ranging 1-7 beats per minute, 
nevertheless it seems fairly well established that small but statistically 
significant changes in heart rate can be obtained through giving subjects 
feedback or reinforcement. However, these changes, though of sufficient 
magnitude to be statistically significant, are not clinically significant. 
For the purpose of this review, a change in response is considered to 
be clinically significant if it is equal to either 20% of the base rate or, 
in the case of an abnormally high response, the change is such as to return 
the response to the normal range. 

The three studies which have demonstrated large magnitude changes 
in heart rate include a single subject whom Headrick, Feather, and Wells 
(1971) ran for 14 sessions and who demonstrated within-trial increases 
in heart rate of 30 beats per minute for the short 1-minute trial. Recently 
Stephens, Harris, and Brady (1972) have reported on four normal sub- 
jects who were run under a complex combination of feedback and rein- 
forcement conditions for 10-18 sessions of trials varying 5—60 minutes. 
In all four subjects clinically significant within-trial increases in heart 
rate were obtained, ranging 15-29 beats per minute; moreover, two of 
the subjects were able to decrease their heart rate 5-8 beats per minute 
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their heart rate was in the appropriate direction relative to a criterion 
heart rate. Stephens, Harris, and Brady (1972) followed a similar pro- 
cedure. 

Most of the other studies of heart rate change are described in terms 
of self-control established through external sensory feedback. It is pos- 
sible, of course, to construe all of these studies as examples of operant 
conditioning in which the reinforcement is knowledge of having success- 
fully emitted the correct response. On the other hand, it is equally possi- 
ble to construe the reinforcement, especially that used by Bergman and 
Johnson (1972), in informational terms as binary feedback, since subjects 
receive reinforcement if they are correct and do not receive it if they 
are not making the correct response. 

If one considers knowledge of results to be a reinforcer, feedback and 
reinforcement are inextricably confounded in the feedback studies. 
Therefore, it is not possible to settle the question by merely giving one 
group of subjects feedback and a second group reinforcement and then 
comparing their performance. However, it would be possible to deter- 
mine if the reinforcement used by Engel has an effect above that achieved 
by feedback by comparing the performance of groups of subjects run 
in Engel’s paradigm with an without monetary reinforcement. 


Type of Feedback 


There are two dimensions in the classification of types of feedback 
used. The first is sensory modality. Feedback has been given to subjects 
by either auditory or visual means, or by both, with no apparent rationale 
given for preferring one mode to the other. Blanchard and Young (1972), 
in their study of the relative efficacy of these two modes, made both 
within- and between-subject comparisons and found no statistical ad- 
vantage for one sensory modality over the other. Thus it would seem that 
the decision as to which sensory modality one should use, particularly 
in a clinical procedure, must rest on factors other than efficacy. 

In describing the second classification dimension, we have introduced 
some new terminology which may require explanation. By binary feed- 
back we refer to a situation such as that in studies by Engel and his asso- 
ciates and by Brener and his associates wherein the subject’s heart rate 
in terms of interbeat intervals, the reciprocal of heart rate, is compared 
to some criterion on a beat-by-beat basis. After each comparison the 
subject is automatically informed as to whether or not he has met or 
exceeded the criterion by the presence or absence of the feedback signal, 
respectively. This class of feedback has been delivered in either the 
auditory mode (Brener, Kleinman, & Goesling, 1969) or the visual mode 
(Engel & Chism, 1967b; Engel & Hansen, 1966). 

By proportional feedback we refer to a situation such as that in studies 
by Finley (1971), Sroufe (1971), Blanchard, Young, and McLeod (1972) in 
which the subject is provided with direct knowledge of heart rate on a 
beat-by-beat basis. With this kind of feedback the subject is told not only 
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if he is right or wrong as in binary feedback but also by how much. At 
the time of this review, no one has compared binary and proportional 
feedback with regard to relative efficacy. 


Knowledge of Response by the Subject 


Whether or not the subjects should be informed of the particular re- 
sponse which they are to change is an issue in the field of heart rate 
control which provokes almost as much mixed opinion as the particular 
feedback arrangement to use. From Table 1 one can see that researchers 
are fairly well divided as to whether or not to inform the subjects of the 
nature of the response. For example, Brener changed his policy on in- 
forming the subjects: in his first two studies (Brener & Hothersall, 1966, 
1967) the subjects were not informed; in his later study (Brener et al., 
1969) they were informed. 

This difference of opinion seems to date back to the early work of Engel 
and his associates. For reasons which are unclear in their reports, they 
elected not to inform subjects what response was related to the feedback 
and reinforcement. Engel and Hansen (1966) found, post hoc, that sub- 
jects who did not correctly infer the response that was related to the 
feedback and reinforcement contingency showed evidence of learning 
the heart-rate-slowing response whereas four of their five nonlearners 
correctly guessed that heart rate was the response. In a second study 
(Engel & Chism, 1967b), although all five subjects showed evidence of 
heart rate speeding, four of the five subjects did not know the nature 
of the response. 

Bergman and Johnson (1972) have taken a step toward resolving this 
issue by treating awareness of the response as an independent variable. 
Of two groups of subjects receiving no feedback of heart rate, those in- 
formed of the response raised it significantly higher (AX = 1.8 beats 

per minute) than uninformed subjects trying to control an unspecified 


response. Interestingly, a third group of subjects who were both in- 
formed and given feedback in the form of heart sounds did no better than 
the informed, no-feedback subjects. 

One would hope that further research in this area will show that knowl- 
edge of the response being changed, at the very least, has no detrimental 
effect since in a clinical a к 


| pplication of these procedures one would prefer 
to be able to inform patients fully about the procedure. " 
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A second policy on control groups, which has been followed by Engel 
and Hensen (1966) and Engel and Chism (1967b), is the use of yoked 
controls. In this procedure the control subject receives the same feedback 
signal as the experimental subject to whom he is yoked. A similar form 
of false feedback has been used by Finley (1971). 

Church (1964) has pointed out the statistical and methodological falla- 
cies inherent in the use of yoked controls. Examination of the results for 
subjects given some form of false feedback reveals that the performance 
of these yoked control subjects tends to deteriorate, that is, their re- 
sponses are in the wrong direction. For example, in the study of heart 
rate slowing by Engel and Hansen (1966) the control subjects achieved 
a mean increase in heart rate of 5.6 beats per minute. Furthermore, 
Finley’s (1971) control subjects showed an increased heart rate (AX = 
.8 beats per minute) when they received false feedback as contrasted to 
times when they received no feedback. Comparisons of changes in heart 
rate for experimental and control subjects thus yield spuriously large 
effects because of capitalizing on differences which include the deteriora- 
tion of the controls. 

The appropriate control condition would be one in which subjects re- 
ceive the same instructions as the experimental subjects but no feedback. 
The inclusion of an instructed, no-feedback control has been followed 
by Blanchard and Young (1972), Bergman and Johnson (1972), Brener 
et al. (1969), Johns (1970), and Sroufe (1971). 


Changes in Heart Rate in the Absence of Feedback or Reinforcement 


The use of a no-feedback condition enables one to detect if subjects 
can significantly change their heart rate in the absence of external feed- 
back or reinforcement. The results on this point are mixed. Bergman and 
Johnson (1971) found both significant increases and decreases for very 
brief (4—6 seconds) trials in the absence of any feedback. In a later study, 
described above, Bergman and Johnson (1972) found that subjects in- 
structed to increase heart rate but given no feedback did as well as simi- 
larly instructed subjects who received a form of auditory feedback. 

Although the subjects of Brener et al. (1969) who received feedback 
on 100% of the trials showed significantly greater heart rate control 
than those subjects who received no feedback, the latter group did show 
а small (AX = 2.4 beats per minute) but significant difference between 
raise and lower trials. However, since the results were reported as dif- 
ferences, no directional statement can be made from their data. Both 
Johns (1970) and Blanchard and Young (1972) found that their no-feedback 
control subjects could lower heart rate but could not raise it (X = -2.3 
beats per minute, Х = —2.2 beats per minute, respectively). At this 
point, one cannot conclude definitely that feedback enables subjects to 
change heart rate in either direction more than subjects who are similarly 
instructed but receive no feedback. 

Another way of viewing this question is to examine how well subjects 
who have been trained to control heart rate with feedback can control 
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their heart rate in the absence of feedback. This is a point of considerable 
clinical significance since transfer of the self-control that patients learn 
in an experimental setting to their natural environment is essential. 
Brener et al. (1969) found significantly greater generalized self-control 
for subjects given feedback on 100% of trials than to subjects given no 
feedback. 

Sroufe (1971), working primarily with heart rate variability (to be 
discussed later), has achieved similar results. His subjects, who were 
taught to control heart rate variability through paced respiration and 
who were given proportional visual feedback, showed significant transfer 
of self-control to a no-feedback state after only a brief (5-minute) training 
period. 


Respiratory Effects 


An issue in the field of self-control of cardiac function surrounded by 
much controversy is the relative importance of mediational variables. 
Although mediation by means of cognitive or unspecified skeletal mus- 
cular mechanisms has been suggested, the primary focus of the mediation 
controversy has been on the effects of respiratory variables. Experi- 
mental procedures designed to answer whether or not the obtained heart 
rate change resulted from respiratory changes fall into four general 
categories. Although no investigator failed to monitor respiration, some 
did little else. Brener and Hothersall (1966) and Brener et al. (1969) 
report monitoring respiration but do not mention any instructions con- 
cerning breathing, nor do they report any analyses of the observed 
differences. 

Blanchard and his associates (Blanchard et al 
Young, 1972) and Engel and his associates 
& Hansen, 1966; Levene et al. 
tions the explicit instructions 
normal. Again, however, 
data were used to reach t 
significantly and that the 
from respiratory changes. 


1972; Blanchard & 
(Engel & Chism, 1967b; Engel 
, 1968) gave all subjects in their investiga- 
not to change their breathing pattern from 
only monitoring and visual inspection of the 
he conclusions that respiration did not change 
observed heart rate differences did not result 
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a pacing technique. This resulted in changes in heart rate variability 
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appears to deteriorate with any session when the subject is attempting 
to control two systems simultaneously. 

The final method used (Finley, 1971; Headrick et al., 1971; Hnatiow 
& Lang, 1965; Lang, Sroufe, & Hastings, 1967) to ensure that changes in 
heart rate were not the result of respiratory differences is to monitor, 
analyze, and control statistically through covariance analyses any sys- 
tematic variations in respiration that are found. There have been no 
significant relationships between heart rate change and respiratory 
changes with two minor exceptions: Headrick et al. (1971) found a sig- 
nificant correlation between breathing amplitude and heart rate for only 
one subject; and Hnatiow and Lang’s (1965) single significant correlation 
was between average heart rate deviation during the final no-feedback 
period and the average inspiration to total cycle ratio for the final feed- 
back period. 

A summary of cardiorespiratory relations is not complete without the 
inclusion of two studies which focus specifically on them. In the earlier 
investigation (Engel & Chism, 1967a) 20% changes in respiration rate 
sustained for a 10-minute period did not cause changes in average heart 
rate. However, an inverse relation was noted between respiration rate 
and heart rate variability. Sroufe’s (1971) series of three experiments 
was broader in scope since he manipulated respiratory volume in addition 
to respiration rate. His results confirmed the earlier finding of Engel 
and Chism (1967a) that changes in respiration rate affect only heart rate 
variability with increases in respiration rate producing a more stable 
heart rate. However, changes in respiratory volume affected both the 
level and variability of heart rate with deeper breathing producing a 
higher but more variable heart rate. 

In light of the evidence, monitoring of respiratory variables seems 
mandatory for those investigations attempting to show heart rate con- 
trol. However, monitoring alone is not sufficient; the effects of respira- 
tory changes must be controlled either experimentally or statistically 
to eliminate this possible source of confounding. As Sroufe (1971) has 
so aptly observed, “The fact that cardiac control is possible with respira- 
tion experimentally controlled merely suggests that subjects can control 
rate independent of respiration, not that respiration is of minor impor- 
tance [p. 654].” 

Sroufe’s (1971) research carries several implications and suggestions 
for subsequent research in the area of heart rate self-control, particularly 
clinical studies: Since respiration and skeletal muscle activity are more 
diseriminable than cardiac activity, they might profitably be used early 
in training and gradually faded as the desired response is acquired. As 
Sroufe (1971) showed, utilization of these or similar techniques improved 
the transfer of the learned cardiac control from periods where feedback 
is present to periods where it is absent: ultimately, utilization improved 
the first step in establishing generalization from the experimental cham- 
ber to the natural environment. 
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HEART RATE VARIABILITY 


Lang and his associates have been the principal researchers investi- 
gating the self-control of heart rate variability. All of their studies have 
used the same basic experimental procedure: The experimental subjects 
are given proportional visual feedback of their heart rate and asked to 
keep it within +3 beats per minute of a set point (their mean preexperi- 
mental heart rate). Dependent measures are the standard deviation of 
the distribution of interbeat intervals and time on target (within the + 
beats per minute range). 

In the first study (Hnatiow & Lang, 1965) experimental subjects per- 
formed significantly better than yoked controls who received false feed- 
back and who were told only to track the visual display, with no mention 
of controlling it. 

The second study (Lang et al., 1967) was a refinement of the first to 
include additional controls. Subjects in the experimental group again 
received proportional visual feedback of heart rate, were told what the 
feedback display represented, and were instructed to try to keep their 
heart rate within a 6 beat per minute range. Those in one control group 
received the same instructions but the feedback display of their yoked 
experimental partner. The other two control groups were not informed 


concerning what the visual display represented and were told only to 
track the visual display. 


The experimental 


perimental subjects improved 
ack-available conditions. The 


(Sroufe, 1969) subjects again were 
al visual feedback of heart rate and 


Interestingly, a final Study in this series (Hnatiow, 1971) failed to 
replicate the early findings of better control by subjects receiving feed- 


back than the yoked controls in spite of using identical equipment and 
procedures. 
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To date, Lang and his associates have reported no clinical application 
of their procedure. In fact, all of their work has been with normal college 
students. 


BLOOD PRESSURE 


In Table 2 are summarized the studies involving change in blood pres- 
sure as the dependent variable. 

In all of their blood pressure control studies, Shapiro and his colleagues 
at Harvard have used the same basic experimental procedure: A subject 
receives binary feedback, both visual and auditory, as to whether his 
blood pressure is at or in the appropriate direction away from a criterion 
blood pressure value on each heart beat; in addition to this sensory feed- 
back the subject receives a reinforcement of a brief presentation of a 
slide of either a nude, a landscape, or an indication of small financial 
reward on some fixed-ratio schedule for a blood pressure response in the 
appropriate direction. 

The method of delivering this feedback involves inflating an arm cuff 
to a pressure very close to either systolic or diastolic blood pressure, 
depending on which is of interest, then varying the pressure in small 
increments and noting the presence or disappearance of Korotkoff sounds 
within a specified time interval after each heart beat. Feedback delivery 
is dependent upon the appropriate response. For example, in trying to 
shape a decrease in systolic blood pressure, feedback would be given the 
subject if no Korotkoff sound was detected following the R wave of the 
electrocardiogram. Had a sound been detected, indicating that the sys- 
tolic pressure was above the cuff pressure, no feedback would be given. 
Typically, subjects are given 25 trials per session, with a trial being 50 
interbeat intervals long. 

As сап be seen in Table 2, with the exception of their recently com- 
pleted clinieal trial (Benson, Shapiro, Tursky, & Schwartz, 1971) the 
magnitude of change in blood pressure has been rather small, averaging 
less than 1 millimeter of mercury per trial for subjects who were in- 
creasing blood pressure and about 2-3 millimeters of mercury per trial 
for subjects lowering their blood pressure. Using as an index of the 
efficacy of their procedure the average blood pressure on the final block 
of five trials, the average amount of increase in blood pressure has been 
1-2 millimeters of mercury while the average decrease in subjects lower- 
ing blood pressure has been 4-6 millimeters of mercury. 

Despite their impressive series of studies many of the criticisms pre- 
viously leveled at the research on self-control of heart rate can be sup- 
plied to the blood pressure self-control studies. First and foremost, it 
is not possible to tell what are the active ingredients in the Shapiro pro- 
cedure since subjects receive both binary auditory and visual feedback 
as well as reinforcement on some fixed ratio schedule. Whether rein- 
forcement contributes to the efficacy of the procedure is not known, nor 
are the relative efficacies of the types of feedback. 
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A second problem with these studies has been the failure to include а 
no-feedback control in order to evaluate the efficacy of their combination 
of procedures over merely instructing subjects to lower their blood pres- 
sure. In one study (Shapiro, Tursky, and Schwartz, 1970a) a group rein- 
forced on a random basis was included. Results showed that groups 
reinforced for raising or lowering blood pressures, while different from 
each other, did not differ from this group which was reinforced randomly. 
Thus to date, this group of investigators has shown primarily that sub- 
jects reinforced for raising blood pressure are different from subjects 
reinforced for lowering blood pressure. 

Despite the methodological problems with these studies, the Shapiro 
group certainly deserves praise for their pioneering work in this area. 
Recently they have focused on self-control of diastolic blood pressure 
(Shapiro, Schwartz, and Tursky, 1972) and the degree of integration and 
differentiation of the cardiovascular system (Schwartz, 1972; Schwartz, 
Shapiro, & Tursky, 1971). The former line of research, using the same 
basie experimental paradigm, has shown that diastolic blood pressure 
can be altered significantly by this procedure and, of much interest from 
a clinical point of view, that the changes show some degree of stability 
after the feedback and reinforcement are no longer available. In the 
latter research area a fair degree of independence of heart rate and blood 
pressure have been shown. 

The study by Benson et al. (1971) answers one of our major criticisms 
leveled at the blood pressure self-control research, namely, the exclusive 
relianee on normal subjects. Benson applied Shapiro's basie blood pres- 
sure control procedure to a group of seven hypertensive patients. Their 
average systolic blood pressure during the final base-line periods was 
165 millimeters of mercury. Clinically significant decreases in blood 
pressure were obtained in five of the seven patients (Х = 22.6 milli- 
meters of mercury). 

As with the heart rate studies which showed large scale changes in that 
response, numerous trials were necessary to obtain this magnitude of 
change (X = 21.7 conditioning sessions of 25 trials each). While no data 
are available on the stability of the change when the feedback was no 
longer available or on the degree of transfer of the decreased blood pres- 
sure outside of the laboratory, these results are provocative enough to 
warrant following up and certainly to increase the clinical promise held 
out by this research. 

As can be seen in Table 2, there has been only one important study 
of blood pressure self-control conducted by an investigator outside of 
the Shapiro group at Harvard, and that is by Brener and Kleinman 
(1970). This study, which utilized a form of proportional visual feedback, 
has several commendable features: The size of the change in blood pres- 
sure, approximately 15 millimeters of mercury in the first session and 
approximately 12 millimeters of mercury in the second, is considerably 
larger than that reported by Shapiro and his colleagues with normal 
subjects and is of a magnitude which could be clinically significant. Sec- 
ond, control subjects who received the same type of feedback of their 
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own blood pressure but were not instructed to change their blood pressure 
were used and showed no overall change in blood pressure. A further 
control group is needed, however, in which subjects, who are instructed 
to try to lower blood pressure, receive no feedback. 

The form of feedback used within the classification system presented 
previously is closest to proportional visual feedback. Subjects were able 
to view a manometer which continuously displayed pressure in the oc- 
cluding staff. The cuff pressure,, and hence the feedback received by 
subjects, oscillated about the individual subject’s systolic blood pressure. 
An interesting aspect of this procedure was the taking of blood pressure 
on the left index finger rather than upper arm. Such a procedure mini- 
mizes the problem ischaemia. 

The size changes in blood pressure obtained by Brener and Kleinman’s 
procedure are much larger than those obtained by Shapiro and his col- 
leagues іп a similar length of time on a similar population of normotensive 
college-age males. One could speculate that the major reason for this 
difference is that Brener and Kleinman’s subjects received a form of 


proportional feedback whereas Shapiro’s subjects received binary feed- 
back delivered both visually and aurally. 


CARDIAC ARRHYTHMIAS 


gible reinforcer, the au- 
than self-control through feedback, ына ee 
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alternately presented and removed; the initial ratio of 1 minute of feed- 
back to 1 minute without was extended to 1 minute on and 7 minutes off. 
Interestingly, in this final phase, subjects did as well without feedback 
as with it, indicating that true self-control had been learned. 

Four of the eight patients had a significantly reduced frequency of 
premature ventricular contractions after the training procedure as mea- 
sured in the ward and at a follow-up 3-21 months afterward. A fifth 
patient did not have a significant reduction in premature ventricular 
contractions rate but did learn to recognize their occurrence and was able 
to control them at home through resting. The other three patients showed 
no evidence of learning to control premature ventricular contractions. 

Total number of training sessions ranged 22-77 with a mean of 57.5. 
The five cases which can be considered successes all had 47 or more train- 
ing sessions. Our reanalysis of these results show that improvement is 
significantly (by p = .05, Fisher's exact probability test) related to re- 
ceiving 47 or more training sessions. 

Weiss and Engel deserve commendation for the following two related 
techniques for extending their procedures into the area of clinical appli- 
cation: first, eliminating the necessity for external feedback to control 
the cardiac dysfunction, and more importantly, monitoring premature 
ventrieular contraction rate outside the experimental chamber. The 
degree of generalization observed in this latter procedure is especially 
encouraging. Generalization of learned control of cardiac functioning to 
the natural environment, true self-control, is the desired clinical terminal 
behavior. 

There are two obvious criticisms which can be leveled at this otherwise 
excellent clinical study. The first is the lack of a controlled group of pa- 
tients who were either untreated or given some equally time-consuming 
and involving placebo task. This criticism is obviated to some degree 
because of the prolonged base-line period prior to the experimental phase 
during which premature ventricular contractions were noted. The docu- 
mented history of premature ventricular contractions ranged 3 months- 
8 years with a mean of 3.0 years. Despite this, one would hope that a next 
step would be a controlled outcome study of the procedure. 

The second criticism is the obvious confounding of several aspects of 
the training procedure. One does not know whether the entire sequence 
of training procedures is necessary or only part of it. Thus a next step 
would be to isolate which parts of the training procedure are necessary, 
facilitative, or superfluous, in order to improve efficiency. 


ALTERNATIVE METHODS OF CARDIAC CONTROL 


There are at least two additional lines of research which are related 
to the area of self-control of cardiac function: (a) the use of progressive 
relaxation as developed by Jacobson (1938) and (b) the use of various 
yogic exercises and related forms of meditation. In both of these a sig- 
nificant degree of change in various cardiac functions has been produced. 
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Progressive Relaxation 


Jacobson (1939) reported on the effects on blood pressure of training in 
and subsequent use of his system of muscle relaxation. His conclusions, 
which tended to be more qualitative than quantitative, were that there 
is a general relation between decreases in blood pressure and decreases 
in muscle activity as shown by the electromyogram and that training in 
progressive relaxation results in greater decreases in electromyogram 
activity than self-induced relaxation without training. He presented 
blood pressure data on 14 normotensive subjects, 9 of whom had been 
trained in progressive relaxation. These data, both systolic and diastolic 
blood pressure, were collected after a 15-minute adaptation period during 
which the subjects rested in a supine position and again after 45 minutes 
of self-induced relaxation. Analyses of these data by the present authors 
revealed significant decreases in both systolic blood pressure (AX = 8.0 
millimeters of mercury; F = 19.9, df = 1/12, p < .01) and diastolic blood 
pressure (AX = 7.8 millimeters of mercury; F = 11.5, df = 1/12, p <.01), 
but no differences between the trained and untrained subjects in degree 
of reduction achieved. It should be noted that the magnitude of change 
in blood pressure in this study exceeds that reported by Shapiro and his 
colleagues in any of their studies of normotensive subjects (see Table 2) 
and compares favorably with the results reported by Brener and Klein- 
man (1970). 

In a second study, Jacobson (1940) reported decreases in heart rate 


associated with relaxation training. Heart rate was measured initially 
after a 15-minute adaptati 


А In a more recent study, Paul (1969) reported on physiological changes 
in college-age females who Were treated for 30 minutes by either (a) a 


brief form of progressive rel (b) hypnosis induction and 
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It thus seems that training in deep muscle relaxation, even of an ab- 
breviated form, can lead to decreases in heart rate comparable to those 
achieved by most of the studies in the biofeedback area. This finding 
leads us to speculate that the operant conditioning or self-control pro- 
cedures are possibly only inefficient methods of teaching subjects to 
relax. 


Yoga 

Although many claims of phenomenal degrees of self-control among 
Yogis had been made, few scientific data were collected on this subject 
until the 1950s when Wenger and his associates (Wenger & Bagchi, 1961; 
Wenger, Bagchi, & Anand, 1961) made systematic autonomic recordings 
from over 50 Yogis in India. In one study of three Yogis who claimed to 
be able to stop their hearts, Wenger et al. (1961) found (a) absence of 
radial pulse, (b) absence of most heart sounds, (c) blood pressure rad- 
ically increased, but (d) no stoppage of the heart and, in fact, (e) very 
little slowing in heart rate. They concluded that these subjects were 
utilizing thoracic muscle interference with venous return which led to a 
radical decrease in blood flow. 

A fourth Yogi claimed only to be able to slow his heart rate and did so, 
demonstrating increases in interbeat intervals from a base line of approx- 
imately 1.0 second to slightly over 2 seconds for several beats (equivalent 
to changes in heart rate from 60 beats per minute to 30 beats per minute.) 
The authors concluded that no muscle interference was apparently used; 
rather, the alterations seemed to be due to vagal innervation. 

A similar example of voluntary cardiac arrest was reported by McClure 
(1959) in the case of a 44-year-old Caucasian male. This patient, who 
showed no cardiac irregularities, could voluntarily go into cardiac arrest 
for periods of approximately 5 seconds, increasing the interbeat interval 
of his heart rate from normally 1.0 second to over 5 seconds. Although 
the patient did not know the mechanism involved, McClure concluded 
that it was due to vagal innervation and was entirely unrelated to respira- 
tion rate. { 

Three systematic studies of the effects of Yoga exercises and medita- 
tion on cardiac function have recently been reported. Datey, Deshmukh, 
Dalvi, and Vinekar (1969) studied the effects of training in the yogic 
exercise called Shavasan on 47 hypertensive patients. The patients, 
whose original blood pressures were in the range 160-270/90—145, all 
received training in Shavasan for unspecified lengths of time. The typical 
training course was described as requiring 3 weeks of daily 30-minute 
sessions. Results were reported for three groups of patients; (a) 10 not 
on medication; (b) 22 whose hypertension was controlled by drugs; and 
(с) 15 whose hypertension was not adequately controlled by drugs. In 
all cases, blood pressure was reported as the diastolic pressure plus 
one-third of the pulse pressure. For those not on medication, blood pres- 
sure was reduced in 9 out of 10 cases with the average reduction being 
27 millimeters of mercury (р < .05). In the second group, the effects of 
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training in Shavasan were evaluated in terms of the reduction in patients’ 
drug dosage necessary to keep their blood pressure under control. In 
13 of 22 patients the drug dosage was decreased by 33% or more, with 
an average reduction for those 13 patients of 68%. In the third group, 
the effects were evaluated in the same fashion, and drug dosage was 
reduced by at least 33% in 6 of 15 patients. This study seems to show that 
a chronic course of training in yogic exercise had a clinically significant 
effect on a majority of the hypertensive patients studied and would 
warrant investigation in a controlled-outcome study. 

In the first of two studies of similar design, Wallace (1970) studied the 
physiological effects of transcendental meditation, as taught by the 
Maharishi Mahesh Yogi. Physiological responses of 15 normal college 
students, who had been practicing transcendental meditation for from 
6 months to 3 years, were monitored both for 30 minutes prior to a 20-30 
minute meditation period and during the latter period. In the five sub- 
jects for whom heart rate was measured, an average decrease of 5 beats 
per minute was observed. It was not stated, however, whether or not 
the magnitude of change was statistically significant. 

In a more elaborate study of similar design, Wallace, Benson, and 
Wilson (1971) monitored physiological responses of 36 subjects who had 
been practicing transcendental meditation for an average of 29 months. 
The principal results of comparisons between the meditation period and 
the 30 minutes immediately preceding, during which subjects were rest- 
ing comfortably, were significant changes in electroencephalogram and 


— лаш There was also а small but significant (p < .05) 
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alternative modes of obtaining changes in cardiac functioning—such as 
training in progressive relaxation, transcendental meditation, or various 
Yoga exercises—may prove to be more fruitful approaches to obtaining 
clinically significant changes than the operant-conditioning-feedback 
techniques reviewed in the main body of this paper. Only comparative 
outcome studies, conducted on a chronic basis, can answer this question, 
but certainly it is deserving of an answer. 
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4. BEHAVIOR 
MODIFICATION: 


A Brave New World 
or 1984? 


In the last few years there has been a concerted effort on the part of 
psychologists to make explicit something that has been known but not 
always understood: that behavior will usually change as a function of its 
result. In the field of psychology, 
referred to, behavior mod) has d 
topic, raising both blood pressur 
to the underlying controversy. 
of the following two articles, р 
modifiers to develop an in- 


to be? 


The End of Ideology in Behavior Modification m 


E on the other hand, is concerned about the “views about the 
oa of man" implied by behavior theory, the same theory that London 
s sses as not very good, not very useful—actually not even theory! 
о wonder behavior modification is a controversial topic! As Bandura 
eee out, what we believe about human nature is very important, both 
© а: research and for current applications: “What we believe man 
"a " affects which aspects of human functioning we study most thor- 
- ghly and which we disregard. Premises thus delimit research and are, 
п turn, shaped by it. As knowledge gained through study is put into 
practice, the images of man on which social technologies rest have even 
vaster implications." 
Ви. you think Bandura would accept London’s view that behavior mod- 
m — is and should be a form of psychotechnology? Why? Is there 
m ing more to be gained from probing the results of behavior mod- 
туа ee: research than simply better ways to eliminate bed-wetting? Or 
th erely finding the best technology for a variety of behavior problems 
e sum and substance of behavior modification? And what about the 
models of man that Bandura refers to? Society as a whole seems to re- 
quire such models. These change with time, sometimes rapidly, but is 
Society as a whole likely to abandon such models and operate as a vast, 
complicated but model-less technology? 
А Bandura also discusses several situations in which external reinforcers 
© not have the expected result on behavior (as with principled dissenters 
and nonconformists). What would a psychotechnologist think of these 
Situations? 
as not too far beneath the surface of Bandura’s argument is 
a x ageless debate in psychology, that of free will versus determin- 
. How does Bandura deal with this issue? What do yow think about it? 
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IN BEHAVIOR MODIFICATION 


Perry London, University of Southem Califomia 
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them, raised them as their own, and eventually restored them to their 
rightful positions as the benefactors of behavior. When they grow up, 
confer their own benefits, and spawn their own little behavior modifiers, 
they become disabused of the myth. But not entirely—there remains, at 
the very least, a tradition of deferring to the principles of learning as 
the ultimate source of all good modifications and a parallel ritual of knock- 
ing psychoanalysts, Rogerians, existentialists, and general psychiatrists 
who have not yet mastered or endorsed the jargon of respondents, oper- 
ants, and reinforcements. 

Like most myths this tale has been based on some true events and 
has proved useful for the promotion of behavioristic patriotism and for 
extending the frontiers of experimentation and the clinical and technical 
growth of the field. And it makes a good story. But enough is enough. 
The borders are secure now, the settlers are thriving, the dark untram- 
meled forests are trammeled and cut down, the stumps blown, the fields 
plowed. It is time now to build this domain, not defend it. 

Building it requires three things: first, abandoning strife with other 
modalities and devel 


offer and exchange; second, examining the constructs that underlie mod- 
ifying operations a 


У ry is likely to keep up with, This 
article argues that it is probably just as well. xa 


of learning 
ordered hu S 
ere were оп] i- 
ples that they ever referred to, all of whi y about three princ 
and a half 
in time, s 
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and operant behavior shaping (London, 1964). And they are exceptions 
only because the former is not derived from the usual learning principles, 
and cannot be, and because the latter is something everybody knew about 
anyhow but never applied ingeniously until Skinner laboriously spelled 
out its monumental implications. 


EVOLUTION OF BEHAVIOR MODIFICATION 


The early growth of behavior modification as a professional specialty 
was largely polemical and political, not theoretical, and most of its scien- 
tific hoopla evolved to serve the polemical needs of the people who made 
it up—not all of it, however, and not only polemical needs. 

The study of learning for behavior therapists, in fact, was always more 
for the purpose of metaphor, paradigm, and analogy than for strict guid- 
ance about how to operate or about what it all means. Whatever value 
theory may have for dictating laboratory procedures, therapeutic oper- 
ations have been essentially seat-of-the-pants affairs, and still are, be- 
Cause they address immediate practical problems that require solutions 
їп fact, not in principle. The disregarding of principles to see what really 
Works, in this connection, reflects the intelligence and scientific good 
Sense of therapists. The search for principles to explain what works, on 
the other hand, reflects their integrity and their anxiety—integrity 
because only fools would accept their own results without question, 
anxiety because therapists seek principles as much to increase their 
confidence as to reduce their ignorance. People also look for principles 
to help fight intellectual battles. Conventional therapies could be as- 
Saulted for their ineffectiveness, but the basis for offering new ones had 
to be more than complaints about the old ones. And in the case of be- 
havior modification, it had to be more than a simplistic appeal to “what 
Works." Of all popular therapies, psychoanalysis in particular was based 
Оп a pretentious, respectable, and smart theory. It could not be chal- 
lenged with less than a theory. 3 Р 

Learning theory was an obvious choice: first, because there is a history 
of suggested therapeutic applications of learning principles, from Thorn- 
dike and Pavlov through Mowrer and Guthrie; and second, because the 
heart of any psychotherapy is changing people’s behavior without chang- 
Ing their body structure or gross functions—which means, in plain lan- 
guage, teaching them and getting them to learn. — 

Even so, the evolution of behavior modification did not go so neatly. 
It could not make much use of learning theory, except for the broadest 
Principles of learning, because most of the principles either do not yet 
have any really systematic application to disorders or because they are 
much more in dispute than naive students realize. | и 

Тће polemics actually used attacked evident flaws in psychoanalytic 
and other psychiatric formulations, sometimes also getting a little pro- 
fessional jealousy of psychologists for doctors into the act. Their main 
Points were (a) an attack on “the medical model"; ( b) the insistence that 
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their symptoms; and finally, (d) the demand that even the name of the 
game should be changed from psychotherapy to behavior therapy, so 
no one thinks we are mentalistic or unscientific, as other therapies obvi- 


This description is not exaggerated at all. Eysenck's (1959) tabulation 
of "the more important differences between Freudian Psychotherapy and 
Behavior Therapy," for instance, says that Freudian therapy is “based 
on inconsistent theory never properly formulated in postulate form," 
While behavior therapy is “based on consistent, properly formulated 
theory leading to testable deducations.” Similarly, Freudian practice 
is “derived from clinical observations made without necessary control, 
observation or experiment,” while behavior therapy is “derived from 
experimental studies Specifically designed to test basic theory and deduc- 
tions made therefrom [p. 67].” Franks (1969) is kinder than Eysenck, 
but even he Says a decade later, “for most behavior therapists, the pre- 


therapists, in which the Sequence is often reversed [p. 3]." 


rely pointless, but they are not entirely 
3 pt who is smart and who will get the last 
word. What makes them largely irrelevant is that explanatory concepts 


erative medical conditions like heart disease 


| Тһе controversy about (learned Versus structural) origins of disorder 
15 mostly pointless because it is mostly irrelevant to treatment. The 
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behaviorist attack commonly assumes that a genetic or biochemical view 
of etiology is bad because if disorder is learned, then treatment must be 
a learning process. This is nonsense. Etiology and treatment have no 
logical connection in either direction. Anxiety, for example, may be the 
result of prior experience but can be alleviated chemically; or it may be 
aroused by essentially chemical circumstances (like fatigue plus mild 
arousal to danger) but can be soothed physically or verbally (Davison, 
1968). 

The only treatment question of relevance is that of the functional rela- 
tionship between the problem and its solution. The process is what mat- 
ters and nothing else. 

The biggest polemic, of course, about treating symptoms versus causes 
has occurred largely because of the uncertain danger of symptom return 
or symptom substitution. Everywhere, much confusion has occurred, 
mostly because of confusion about the medical model—in this instance, 
about the exact meaning of symptom. If symptom means trouble, surely 
it should be treated. If not, maybe not. The polemic has been around the 
question of symptoms as “the main trouble”; this is because of the wide- 
Spread belief, largely from psychoanalysis, in symptom return or sub- 
Stitution. (Buchwald & Young, 1969, point out, incidentally, that some 
analysts, such as Fenichel and, later, Alexander, thought that symptoms 
should be treated directly.) 

The real issue, in any case, is the consequence of the treatment for the 
person's condition—or his life. Sometimes too, the issue is prophylactic 
or preventive rather than ameliorative. This becomes clear by comparing 
the infectious disease and chronic ailment models in medicine. Behavior 
disorders are more like chronic ailments, where medical treatment gen- 
erally aims at the symptoms only. The control of diabetes, on the other 
hand, is not treatment of symptoms, but their prevention. Is the shaping 
of behavior likewise? For changing the table manners of psychotics’ ward 
behavior, maybe not; for molding children’s manners, maybe so. The 
Same question may be addressed to assertive training or to aversion 
training. Is teaching someone to talk back to his mother-in-law reducing 
his anxiety or preventing it from happening? Is teaching a homosexual 
man to attend to women when aroused and to feel repugnance for sex 
with men relieving his symptom or preventing a disorder (which does 
not exist legally until he “goes for” the man), teaching a habit ан 
or unteaching one, changing a Ше style, altering a phenomenologica 
field, shifting ego boundaries, redirecting id impulses or SUperego fune- 
tions, or cathexis? And if he feels different or thinks about it differently, 
18 it still behavior therapy? | 

These are silly qnastiuns. The change in the homosexual is ls e 
things, depending on how you want to talk about them. Understan ing 
the process is what matters, not belaboring the different styles of talking 
about it. The only important question about systematic treatment is that 
of the simultaneous relevance of the treatment technique to the person’s 
manifest trouble and to the rest of his life. 
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is suici Some- 
ometimes it accounts for both at once—as when he is suicidal. r 

cue it accounts for the rest of his life but not for the sp oer ge 
is the behavior modifier's claim against insight types’ preoccupation a 
motivation. Sometimes it accounts for only the manifest ще 
is the insight therapists’ accusation against behavior modifiers, M 
on the specious presumption that everything in a person's life is integr 
to it. 
^ Al of these polemies were not so much meaningless as — в 
assault on psychoanalysts and existentialists was too extreme; the pond 
tifie claims on learning were too grandiose (Buchwald & Young, - 
Franks, 1969). Changing from psychotherapy to behavior therapy ed 
have been useful for people brought up on mind-body dualism, bu d 
those of us who always thought the mind hung around the brain and t d 
behavior meant "what happened," the distinetion was graceless an 

atuitous. 
P When you eliminate the polemics and politics and gratuities, Se 
what remains of theory to define the field and to tell you what it is abou : 
Not a whole lot. The definition of the field either becomes very inclusive 
1970; Paul, 1969; Skinner, 1971) 
от very narrow (Eysenck, 1960; Skinner, 1963; Wolpe, 1958). This prob- 
ably makes no serious difference to anything. As Kuhn (1962) said, “Сап 
a scientist or not [p. 160]?" 


than science. 


There is some dispute in the field about this point. The sides are well 
represented by Lazarus, who agrees with my view, and Franks, who 
does not. Lazarus (1971) said in his latest book: 


espouse some theoretical fra; 


ist, it is necessary to 
is true of the behavioral therapi: 


0 detail here, this 
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The argument that behavior modification should view itself as tech- 
nology rather than as science is not meant to denigrate the importance 
of theory, either scientifically or heuristically. Scientifically, theory is 
valuable because it directs the systematic search for new information by 
interpreting and integrating what is already known. Heuristically, theory 
is valuable because it lays our biases out in the open and by explicating 
them makes it easier for us to disavow them when they are found to be 
inadequate. 

_ The question with respect to behavior modification is twofold: First, 
is current theory very good (scientifically) in this case? Second, is any 
theory very useful for the current development of this field at this time? 
Ithink the answer to both questions is no. 
© In reality, behavior therapists never did have so much a theory as an 
ideology”; in my own paraphrase of Bell (1960) and Tomkins (1964), 
Ideology does not mean just ideas, but ideas to be acted on . . . [Lon- 
don, 1969].” What behavior therapists called theory actually served as 
bases for commitment or a rallying point for talking about disorder and 
treatment in a certain way and, more important, about acting on it within 
particular sets of limited operations, that is, technical limits. In this case 
the commitment is to the functional analysis of problems. And it is for 
this reason that the domain incorporated within the legitimate pur- 
view of the field becomes broader and broader and the polemics milder 
and milder—as it is recognized that even Rogerians and humanists and 
Psychoanalysts may analyze some things functionally and act on them 
accordingly. 
_ What results from the increasing 
increasing number of plausible methods 
proliferation of methods is tested and fo 
Very necessary to explain why they work. For practitioners, none may 
be needed even then, if they work well. In any case, theory will largely 
grow from practice (Lazarus & Davison, 1971) and practice from instru- 
Mentation, in the broadest sense; this will happen to behavior modifica- 
tion just as it did to psychoanalysis, but with two generations’ more 
experience, the results, hopefully, will be more scientific. 

The status of theory comes largely from the belief that technique de- 
velops out of theory, that is, that science underlies engineering. But 
this is only partly true even among very hard sciences, less so among 
Soft sciences, like the social and behavioral sciences, and not at all true 
for many endeavors where the existence of the technological eapacity 
and the practical need is what produces the technical application and, 
indeed, what “nourishes” much of the theoretical development itself 


(Oppenheimer, 1956). 


functional analysis of problems is an 
for coping with them. Until the 
und to work, no theory is really 


‚ This does not mean that the alternative to rude theory is rude empiri- 
cism; rather, it suggests а close-to-the-situation functional analysis in- 
Stead of a premature and precocious search for general principles from 
Which to get overextended, or for a professional ideology to which to be 
committed. Such a development seems to be proceeding now, with prac- 
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titioners using the techniques of behavior modification but declining the 
identity (Paul, 1969b) and scholars recommending heterodoxies like 
“technical eclecticism” without shame (Lazarus, 1971). The rest of this 
article spells out what this development means, so we will see clearly 
what we are doing, with the view to doing it better. 

By deducing the intellectual condition of behavior modification from 
the technology it uses, there seem to be two kinds of conceptual schemes 
which characterize it, Borrowing from Price’s (1971) book, Abnormal 
Psychology: Perspectives in Conflict, in turn derived from Kuhn's (1962) 
Structure of Scientific Revolutions, these are, first, metaphors or anal- 
ogies, and second, paradigms or models. 

In the conventional classification of how much we seem to know what 
we are talking about in science, these concepts fit the lower end of the 
Scale of certainty whose upper rungs are called theories and laws. A 
scientific law, in other words, is an idea that seems absolutely to com- 
prehend all of the facts it talks about; a theory is one that hopes to do so, 
but is not so compellingly known to do so. The activities involved in be- 
havior modification are a good distance below both these points on the 
same continuum. A few are paradigms or models, that is, ideas that seem 
to explain a group of facts pretty precisely; most are metaphors or anal- 
ogies, that is, ideas that look like they might fit a group of facts, where 
there is no clear evidence that they really do. 

In general, the treatment 


ogy are obvious, but the similarities are still 
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tions that could not be predicted from the situations from which the 
metaphor was derived in the first place. Finally, desensitization does 
the same thing that implosion or flooding does, though it appears to be 
an opposite method. 

Implosion is a better analogy than desensitization because Stampfl's 
original experiment (London, 1964) seems to parallel both Mowrer's 
(1950) theoretical statement, Solomon, Kamin, and Wynne's (1953), and 
Black's (1958) animal experiments more closely than Wolpe fits Masser- 
man's eats or Pavlov's dogs. Also, implosion technique does predict all 
of its own variants—including Watson, Gaind, and Marks’ (1971) finding 
that practice helps to extinguish phobic reactions and that using real 
objects instead of imagination as fearful stimuli also is helpful—but it 
18 metaphorical nonetheless because it is between people, and because 
it is administered in social situations where the pressure not to escape 
18 obviously coming from the thoughts that the patient has about the 
Situation, not from physical restraints on his action. 

Shaping is more exemplary or paradigmatic than conditioning treat- 
ments because it presumes less, in the first place, on the black box—mod- 
eling, likewise. Stated differently, both shaping and modeling methods 
can aim for precision in establishing the conditions that will work the 
desired effects without having to worry much about what is going on in 


the person that makes them work. | 
‚ Aversion treatment is more paradigmatic than metaphoric because 
its critical stimulus is not cognitive; that is, it hurts the patient physi- 
cally, Giving a person an electric shock is much more like shocking a 
dog than telling scary stories to a person is like blowing air up a rat’s 
behind; ergo, it is more paradigmatic than metaphoric. | . 

Neither metaphors nor paradigms are scientific theories by some dis- 
tance, but both of them are useful intellectual tools. Metaphor and anal- 
ову are good heuristically—they help to turn images into thoughts and 
inchoate hunches into articulate propositions. Paradigms are more liter- 
ally useful—they model to scale what you need to do, or give a mathe- 
тайса] formula into which you can put specific quanta. | 

Мойсе, however, that none of this has anything to do with whether 
anything works for any practical problem. Desensitization may be a poor 
deduction from conditioning, but it is a fine treatment for phobias. Be- 
avior shaping may be an excellent paradigm for getting autistic children 
9 hug people, but it does not teach them syntactical speech. 


hatever scientific concern people have about the intellectual аш 

Pa field, at this point in the development of this enterprise, none 0 
18 material is yet very important to practitioners, and it is barely = 

Portant to clinical research. What 18 important at this juncture 18 in 

evelopment of systematic practice and of a technology to sustain it. My 

e= is that, in the long run, scientific understanding will derive from 
em 


T practitioner tries to figure 0 
if; then, if he is scientifically inclined, 0‹ 
ìt does. The technologist devises machinery 


ut what will work, for the most part, 
d, looks for a way to rationalize it 
s to support him. This is 
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not a bad idea, even if it doesn’t add up to a test-tube-pure, white-coat 
science. Systematic desensitization is a good example of this process 
because by working in the first place it has given rise to technological 
refinements of itself that also work and that still leave the reasons un- 
explained. In the long run, this must force better and better logical and 
experimental confrontations with the reasons for the phenomenon— 
which success alone would not have done. 


New equipment, new drugs, new gimmickry and gadgetry should now 
be the basis for systematically developing new methods of behavior modi- 
fication and for streamlining the established techniques with controlled 
experimental testing. Instead of looking for new principles or justifying 
worn-out ones, we should look for new applications: What could we do 
to treat such-and-such if we had such-and-such machinery? What would 
be required to build it? To test it? And then, finally, to determine what 
it means? 

The critical point is that good technolog 


3 у always undermines bad 
theory. “Bad” is not meant to be 


І pejorative here; it means metaphoric. 
Precise technology reveals the empirical error in metaphoric reasoning 
and in the operations that result from it. So Lang’s (1969a, 1969b) on- 
line computer removes the therapist from the desensitization process; 
Quirk’s (1970) machine removes the Imagining and the » 

Е - : £ and the relevance of the 
phobie subject matter; and Wolpin's (Jacobs & Wolpi : 
1969; Wolpin & Raines, 1966) r olpin, 1971; Wolpin, 

2% o pm & Raines, 1966) removes the need for hypnosis, for the fear 
hierarchy, and, indeed, for relaxat; ypnosis, for 
need for explanation becomes : 
as does the inadequacy of the у i 
Wolpe (1958) on the whole пе Plausible metaphor that launched 
Technology promises in fact, m, 

: : » › More and mo» 2 t6 
paradigm, if not paradigm + more to turn metaphor 
box and doing funny things io. be a Жы Boing directly into the Lari 

i , teasing] NAE 
А ин ар We can see haus en im sel ee pes 
happening 5 7 or the former, I refer to Fidi. ore what has Т 
> po on png: Stimulation of the brain (С е pp Шо 
ее inser ol he activities undercut ара Da. di Te et 
tterns out of hand peii Patterns, either b ит the 
patverns: оп! ог ру potentiating learni y manipulating 
issue of its importance disappears. ing 50 effectively that the 
Bringing the black box out into the open is " 
ne 


to accrue from biofeedback research, and it is pr 

tion one can find of technology feeding Sion Probably the best illustra- 
hances practice жы е Other. АП of the dia With one hand as it en- 
a single tg ~ p ether they teach рео Чаек methods work on 
switch in t ү. eads or to alter their blood © to ring buzzers from a 
skin resistance by monitoring lights and к Pressure or heartbeat or 
a means of accurately externally recordin ges, The Principle is finding 
of projecting that record into the Persa ПЕ Some Internal process, then 
literally see or hear what his blooq d N’S со; Ко that he cu 
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of the main results likely 
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cles are doing. Then let him learn to manipulate his conscious sensory 
experience in whatever way people learn to do anything consciously — 
something about which we really know next to nothing. 

The clinical value of biofeedback methods remains to be seen and will 

certainly not be what alpha-machine hucksters seem to wish—instant 
cool for overheated psyches. Its clinical utility is not yet the point, only 
its technological sophistication and the scientific potential that comes 
from that. 
_ To turn the point around, I am saying that theory has worn itself out 
in behavior modification and that technology, essentially of treatment, 
Should now be a primary focus, perhaps, in the long range, even for 
Serving scientific purposes. What began with the Mowrers’ quilted pad 
in 1938 is extending now into therapeutic on-line computers, electric 
pottie chairs, electronic skin braille, and an almost self-generating brace 
of valuable hardware for therapeutic purposes. A summary of much of it 
will appear soon in a book by Robert and Ralph Schwitzgebel (in press) 
entitled Psychotechnology. 

But the proper development that I am suggesting is not limited to 
hardware and is not self-generating, but reaches to the systematic ex- 
ploration of all kinds of therapeutic things without inhibition or concern 
as to whether they fit ostensible principles of learning or reinforcement 
or whatever, but with a singular focus on whether they fit the facts of 
human experience. No one has been more forthright or resourceful in 
this connection than Lazarus (1971), whose latest work, Behavior Ther- 
ару and Beyond, goes into such things as imagery exercises for depres- 
Sion; thought control, straight out, for whatever it works for; hypnosis 
for all kinds of things; exaggerated role taking à la the late George Kelly; 
differential relaxation; covert sensitization for aversive imaging; and 
Masters and Johnson’s penis squeezing for premature ejaculation. It 
would take a genius or a madman to shelter this whole array of tech- 
niques under the intellectual umbrella of conditioning, or learning, or 
any other single psychological theory. It would take a fool to want to. 
The first issue, scientifically as well as clinically, is the factual one—Do 
they work? On whom? When? The how and why come late) 

With the era of polemics virtually ended, and Skinner g 
Freud over and over again in his latest work, and the American Psycho- 
analytie Association at a recent convention casually incorporating be- 
havior modification into its discourses on psychotherapy, the political 
utility of learning theory, so called for the definition of the field, is ended. 
It was never really theory anyhow, as we used it, but ideology for pro- 
fessional purposes and mostly metaphor for clinical ones. It is time now, 
I think, for the remedial branch of this business to stop worrying about 
its scientific pretensions, in the theoretical sense, as long as it keeps its 
functional nose clean, and to devise a kind of engineering subsidiary, or 
more precisely, a systems analysis approach to its own operations. We 
have gotten about as much mileage as we are going to out ofold principles, 
even correct ones, but we have barely begun to work the new technology. 


? 
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NOTES 


1. Oppenheimer (1956) distinguished 
larity” of analogous events implies тог 
true of metaphoric comparisons. I am ри 
them both from paradigm. 


analogy from metaphor because the “structural simi- 
e precision in their comparison than is ordinarily 
tting them together here, however, to distinguish 


BEHAVIOR THEORY AND THE MODELS OF MAN 
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The views about the nature of man conveyed by behavior theory re- 
quire critical examination on conceptual and social grounds. What we 
believe man to be affects which aspects of human functioning we study 
most thoroughly and which we disregard. Premises thus delimit research 
and are, in turn, shaped by it. As knowledge gained through study is 
put into practice, the images of man on which social technologies rest 
have even vaster implications. This is nowhere better illustrated than 
in growing public concern over manipulation and control by psychological 
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methods. Some of these fears arise from expectations that i napa 
means of influence will inevitably be misused. Other аро то is 
aroused by exaggerated claims of psychological power couche гави 
language of manipulation and authoritarian control. But most rs E 
from views of behaviorism, articulated by popular writers and by 


fI an 
orists themselves, that are disputed by the empirical facts of hum 
behavior. 


TC 5 
› the fabled reflexive conditioning in ти 
is largely a myth. Conditioning is simply a descriptive term for о 
through paired experiences, not an explanation of how the changes ¢ 


ated ; n 
about. Originally, conditioning was assumed to occur automatically. O 
closer examination it turned out to be 


learn despite repetitive paired expe 
events are correlated (Dawson & Е 
conditioned reactions are lar, 


unaffected by events that are frighten- 


Ч, or pleasurable is to те uire that they 
be less than human. Although negative effects such аз and and dislikes 


can arise from paired experiences ofa direct or sicarios Bor: ва do кае 
of the ennobling qualities of man e pejorative о Yearning 


principles, which appear with regularity in Professional and lay publica- 
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ing, humiliating, disgusting, sa 


es -cience of psycholo 
offensive rhetoric is designed to swa 


It is well documented that behavior ig į ; 
much of the time. The image of man that па она N its he ин а 
on the types of consequences that are acknowled р т. connotes neers 
standing of how they operate, In theories that recognize’ у s the sd of 
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But external consequences, influentia; 
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‘en consequences serve several functions. First, they impart 
e Run By observing the effects of their actions, individuals even- 
я к "pide which behaviors are appropriate in which settings. The 
ibas ec information then serves as a guide for action. Contrary to the 
ЈЕ nanistic metaphors, outcomes change behavior in humans through 
е intervening influence of thought. 
n oc motivate through their incentive value, as well as in- 
es - By representing foreseeable outcomes symbolically, future con- 
i cius can be converted into current motivators of behavior. Many 
fut е things we do are designed to gain anticipated benefits and to avert 
na ure trouble. Our choices of action are largely under anticipatory con- 
ol. The widely accepted dictum that man is ruled by response conse- 
quences thus fares better for anticipated than for actual consequences. 
Consider behavior on a fixed-ratio schedule (say, 50:1) in which only 
лен fiftieth response is reinforced. Since 96% of the outcomes are ex- 
Inetive and only 4% are reinforcing, behavior is maintained despite its 
dissuading consequences. Аз people are exposed to variations in fre- 


quency and predictability of reinforcement, they behave on the basis of 
the outcomes they expect to prevail on future occasions. When belief 
behavior is weakly con- 


differs from actuality, which is not uncommon, 
trolled by its actual consequences until repeated experience instills realis- 
tic expectations (Bandura, 1971b; Kaufman, Baron, & Kopp, 1966). 
Had humans been ruled solely by instant consequences, they would 
have long become museum pieces among the extinct species. Not that our 
future is unquestionably secure. The immediate rewards of consumptive 
life-styles vigorously promoted for short-term profit jeopardize man’s 


long-term chances of survival. But immediate consequences, unless un- 
ich deferred ones (Mischel, 


usually powerful, do not necessarily outwelg 

1974). Our descendants shall continue to have a future only because those 
who foresee the aversive long-term consequences of current practices 
Mobilize public support for contingencies that favor survival behavior. 
Hazardous pesticides, for example, are usually banned before populations 
suffer maladies from toxic residues. The information-processing capaci- 
ties with which humans are endowed provide the basis for insightful 
behavior. Their capacity to bring remote consequences to bear on current 
behavior by anticipatory thought supports foresightful action. 


Explanations of reinforcement originally assumed that consequences 
ious involvement. The still prevalent 


increase behavior without consc 
notion that reinforcers can operate insidiously arouses fears that im- 
proved techniques of reinforcement will enable authorities to manipulate 
people without their knowledge or consent. Although the empirical issue 
is not yet completely resolved, there is little evidence that rewards 
function as automatic strengtheners of human conduct. Behavior is not 
much affected by its consequences without awareness of what is being 
reinforced (Bandura, 1969; Dulany, 1968). After individuals discern the 
instrumental relation between action and outcome, contingent rewards 


may produce accommodating or oppositional behavior depending on how 
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they value the incentives, the influencers, and the behavior itself, and 
how others respond. Thus reinforcement, as it has become better under- 
stood, has changed from a mechanical strengthener of conduct to an 
informative and motivating influence. 

People do not function in isolation. As social beings, they observe the 
conduct of others and the occasions on which it is rewarded, disregarded, 
or punished. They can therefore profit from observed consequences as 
well as from their own direct experiences (Bandura, 1971е). Acknowl- 
edgment of vicarious reinforcement introduces another human dimen- 
sion—namely, evaluative capacities—into the operation of reinforcement 
influences. People weigh consequences to themselves against those ac- 
cruing to others for similar behavior. The same outcome can thus become 


a reward or a punishment depending upon the referents used for social 
comparison. 


Human conduct is better explained by the relational influence of ob- 
served and direct consequence 


behavior is not fully predictable fr 


partly regulate their acti 


1971e; Thoresen & Maho consequences (Bandura, 
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respect for devalued conduct can be abated, however, by self-exonerating 
justifications. I shall return to this issue shortly. 

Another type of conflict between external and self-produced conse- 
quences arises when individuals are punished for behavior they regard 
highly. Principled dissenters and nonconformists often find themselves 
m this predicament. Personally valued conduct is expressed provided 
its costs are not too high. Should the threatened consequences be severe, 
one inhibits self-praiseworthy acts under high risk of penalty but readily 
Performs them when the chances of punishment are reduced. There are 
individuals, however, whose sense of self-worth is so strongly invested 
in certain convictions that they will submit to prolonged maltreatment 
rather than accede to what they regard as unjust or immoral. 

External consequences exert greatest influence on behavior when they 
are compatible with those that are self-produced. These conditions obtain 
when rewardable acts are a source of self-pride and punishable ones 
are self-censured. To enhance compatibility between personal and so- 
cial influences, people select associates who share similar standards of 
conduct and thus ensure social support for their own system of self- 
reinforcement. 

Individualistic theories of moral action assume that internalization 
of behavioral standards creates a permanent control mechanism within 
the person. Restraints of conscience thereafter operate as enduring con- 
trols over reprehensible conduct. The testimony of human behavior, 
however, contradicts this view. Much human maltreatment and suffering 
are, in fact, inflicted by otherwise decent moral people. And some of 
the most striking changes in moral conduct, as evidenced, for example, 
in political and military violence, are achieved without altering person- 
ality structures or moral standards. Personal control is clearly more 
complex and flexible than the theorizing implies. 

Although self-reinforcing influences serve as regu 
they can be dissociated from cens 
tices (Bandura, 1973). One device is 


ally and socially acceptable by defining i л | r 
People do not act in ways they ordinarily consider evil or destructive until 


Such activities are construed as serving moral purposes. Over the years 
much cruelty has been perpetrated in the name of religious principles, 
righteous ideologies, and regulatory sanctions. In the transactions of 
everyday life, euphemistie labeling serves аз а handy linguistie device 
for masking reprehensible activities or according them a respectable 
Status. Self-deplored conduct can also be made benign by contrasting 
it with more flagrant inhumanities. Moral justifications and palliative 
comparisons are especially effective because they not only eliminate 
self-generated deterrents but engage self-reward in the service of repre- 
hensible conduct. What was morally unacceptable becomes a source of 
self-pride. 

А common dissociative practice is to 0 
between one’s actions and the effects t 


lators of conduct, 
urable deeds by self-exonerating prac- 
to make inhumane behavior person- 
t in terms of high moral principle. 


bscure or distort the relationship 
hey cause. People will perform 
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behavior they normally repudiate if a legitimate authority sanctions it 
and acknowledges responsibility for its consequences. By displacing 
responsibility elsewhere, participants do not hold themselves accountable 
for what they do and are thus spared self-prohibiting reactions. Exemp- 
tion from self-censure can be facilitated additionally by diffusing respon- 
sibility for culpable behavior. Through division of labor, division of 
decision making, and collective action, people can contribute to detrimen- 
tal practices without feeling personal responsibility or self-disapproval. 

Attribution of blame to the victim is still another exonerative expedi- 
ent. Victims are faulted for bringing maltreatment on themselves, or 
extraordinary circumstances are invoked as justifications for ques- 
tionable conduct. One need not engage in self-reproof for committing 
acts prescribed by circumstances. A further means of weakening self- 
punishment is to dehumanize the victim. Inflicting harm upon people 
Who are regarded as subhuman or debased is less likely to arouse self- 
reproof than if they are looked upon as human beings with sensitivities. 

There are other self-disinhibiting maneuvers that operate by misrepre- 
senting the consequences of actions, As long as detrimental effects are 
ignored or minimized, there is little reason for self-censure. If conse- 
quences are not easily distortable, distress over conduct that conflicts 
n be reduced by selectively remember- 
he harm of one’s acts. 


esults in a i i 
manner. In many public and tius ,3n ethically questionable 
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Realization that reinforcement is an unarticulated way of designating 
appropriate conduct prompted the use of cognitive factors in the modifi- 
cation of behavior. Not surprisingly, people change more rapidly if told 
what behaviors are rewardable and punishable than if they have to dis- 
cover it from observing the consequences of their actions. Competencies 
that are not already within their repertoires can be developed with 
greater ease through the aid of instruction and modeling than by relying 
solely on the successes and failures of unguided performance. 

As further research revealed that reinforcers function as motivators, 
consequences were recognized as sources of motivation that depend 
heavily for their effectiveness upon the incentive preferences of those 
undergoing change. Hence, people do not indiscriminately absorb the 
influences that impinge upon them. Outcomes resulting from actions need 
not necessarily occur instantly. Humans can cognitively bridge delays 
between behavior and subsequent reinforcers without impairing the 
efficacy of incentive operations. 

At this second evolutionary stage, reinforcement practices changed 
from unilateral control to social contracting. Positive arrangements 
affirm that if individuals do certain things they are entitled to certain 
rewards and privileges. In the case of negative sanctions, reprehensible 
conduct carries punishment costs. The process is portrayed in reinforce- 
Ment terms, but the practice is that of social exchange. Most social inter- 
actions are, of course, governed by conditional agreements, though they 
usually are not couched in the language of reinforcement. Describing 
them differently does not change their nature, however. . 

Contingencies vary in the human qualities they embody and in the 
Voice individuals have in decisions concerning the social arrangements 
that affect their lives. Reflecting the salient values of our society, rem- 

Orcement practices have traditionally favored utilitarian forms of be- 
havior, But conditions are changing. With growing reservations about 
materialistic life-styles, reinforcement practices are being increasingly 
used to cultivate personal potentialities and humanistic qualities. These 
emerging changes in value commitments will probably accelerate as 
People devote fewer hours to working for income and have more leisure 
time for self-development. 


Another change of some consequence is e 
vidual rights. People are seeking a collaborative role in the development 


of societal contingencies that affect the course and quality of their lives, 
As part of this social trend, even the actions taken in the name of рзус P 
therapy are being examined for their ethies and social purposes. These 
Concerns have provided the impetus for prescripts to gnam e d 
Orcement techniques are used in the service of human betterment ra 


than as instruments of social control. | 
‚А ant mE issue is the relative attention devoted to changing 
individuals or to altering the institutions of society to enrich life. If psy- 
chologists are to have a significant impact on common problems of life, 
they must apply their corrective measures to detrimental societal prac- 


the renewed concern for indi- 
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tices rather than limit themselves to treating the casualties of these 
practices. This, of course, is easier said than done. Practitioners, what- 
ever their specialty, are reinforced more powerfully for using their 
knowledge and skills in the service of existing operations than for chang- 
ing them. Socially oriented efforts are hard to sustain under inadequate 
reinforcement supports. 

The methods of change discussed thus far draw heavily upon external 
consequences of action. Evidence that people can exercise some control 


over their own behavior provided the impetus for further changes in 
reinforcement practices, Int 


; and they serve as their own хере 
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M^ фын. responses and experiencing their effects. This legacy is 
eee much with us. The rudimentary form of learning based on direct 
но со псе has been exhaustively studied, whereas the more pervasive 
б р werful mode of learning by observation is largely ignored. A shift 
emphasis is needed. 

to Pie capacity to represent modeled 
бав new patterns of behavior 
i ment. From observing others, 1 
TA is performed, and on later occasions the coded information 
fr es as a guide for action. Indeed, research conducted within the 
cee of social learning theory shows that virtually all learning 
ан resulting from direct experience can occur on a vicarious 
де by observing other people’s behavior and its consequences for 
TA (Bandura, 1969). The abbreviation of the acquisition process 
кв observational learning is, of course, vital for both development 
survival. Modeling reduces the burden of time-consuming perfor- 
mance of inappropriate responses. Since errors can produce costly, if 
not fatal, consequences, the prospects of survival would be slim indeed 
if people had to rely solely on the effects of their actions to inform them 


about what to do. 

In many instances th 
essentially the same form. Driving а 
Surgery, for example, permit little, 
practices. In addition to transmitting particular response patterns, how- 
ever, modeling influences can create generative and innovative behavior. 
In the latter process, observers abstract common features from seem- 
ingly diverse responses and formulate generative rules of behavior that 


enable them to go beyond what they have seen or heard. By synthesizing 
ms, observers can achieve 


features of different models into new amalga 1 acl 
through modeling novel styles of thought and conduct. Once initiated, 
experiences with the new forms create further evolutionary changes. 


A partial departure from tradition eventually becomes a new direction. 
cribed to behavior theory are 


Some of the limitations commonly as 
based on the mistaken belief that modeling can produce at best mimicry 
of specific acts. This view is disputed by growing evidence that abstract 
modeling is a highly effective means of inducing rule-governed cognitive 
behavior (Bandura, 1971а; Zimmerman & Rosenthal, 1974). On the basis 
of observationally derived rules, people alter their judgmental orienta- 


tions, conceptual schemes, linguistic styles, information-processing 
е functioning. Nevertheless, 


strategies, as well as other forms of cognitivi 
faulty evaluations continue to be mistaken for weaknesses inherent in 
theory. 

Observational learning has recen 


activities symbolically enables man 
observationally without reinforced 
one forms an idea of how certain 


e behavior being modeled must be learned in 
utomobiles, skiing, and performing 
if any, departure from essential 


tly come to be accepted more widely, 
but some theorists are willing to grant it full scientific respectability only 
if it is reduced to performance terms. As a result, enactment paradigms 
are used which are rooted in the traditional assumption that responses 
must be performed before they can be learned. Instant reproduction of 
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modeled responses is favored, thereby minimizing dependence upon 
cognitive functions which play an especially influential role when reten- 
tion over time is required. The issue of whether reinforcement enhances 
modeling is pursued to the neglect of the more interesting question of 
whether one can keep people from learning what they have seen. 

When learning is investigated through observational paradigms, a 
broader range of determinants and intervening mechanisms gains promi- 
nence. Learning by observation is governed by four component pro- 
cesses: (a) attentional functions regulate sensory input and perception 
of modeled actions; (b) through coding and symbolic rehearsal, transitory 
experiences are transformed for memory representation into enduring 
performance guides; (c) motor reproduction processes govern the inte- 
gration of constituent acts into new response patterns; and (d) incentive 
or motivational processes determine whether observationally acquired 
responses will be performed. Studied from this perspective, observa- 


tional learning emerges as an actively judgmental and constructive 
rather than a mechanical copying process. 


Because observational learnin 


tional learning is developed Ъ 
vation, in memory encoding, 
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With growing evidence that cognition has causal influence in behavior, 
the arguments against cognitive determinants are losing their force. 

These recent developments have shifted emphasis from the study of 
response learning to analyses of memory and cognition. From this effort 
we have gained a better understanding of the mechanisms whereby 
information is acquired, stored, and retrieved. There is more to learning, 
however, than the acquisition and retention of information. Behavioristic 
theories addressed themselves to performance but deemphasized internal 
determinants, whereas the cognitive approaches remain immersed in 
thought but divorced from conduct. In a complete account of human 
behavior, internal processes must eventually be tied to action. Hence, 
explanations of how information eventuates in skilled performance must 
additionally be concerned with the organization and regulation of be- 
havior. Social learning includes within its framework both the processes 
internal to the organism as well as performance-related determinants. 

Speculations about man's nature inevitably raise the fundamental 
issues of determinism and human freedom. In examining these questions 
it is essential to distinguish between the metaphysical and the social 
aspects of freedom. Many of the heated disputes on this topic arise as 
much, if not more, from confusion over the dimensions of freedom being 
discussed as from disagreements over the doctrine of determinism. 

Let us first consider freedom in the social sense. Whether freedom is 
an illusion, as some writers maintain, or a social reality of considerable 
importance depends upon the meaning given to it. Within the social learn- 
ing framework, freedom is defined in terms of the number of options 
available to people and the right to exercise them. The more behavioral 
alternatives and social prerogatives people have, the greater is their 
freedom of action. 

Personal freedom can be limited in many different ways. Behavioral 
deficits restriet possible choices and otherwise curtail opportunities to 
realize one's preferences. Freedom can therefore be expanded by cul- 
tivating competencies. Self-restraints arising from unwarranted fears 
and stringent self-censure restrict the effective range of activities that 
individuals can engage in or even contemplate. Here freedom is restored 
by eliminating dysfunctional self-restraints. 

In maximizing freedom a society must place some limits on conduct 
because complete license for any individual is likely to encroach on the 
freedom of others. Societal prohibitions against behavior that is socially 
injurious create additional curbs on conduct. Conflicts often arise over 
behavioral restrictions when many members of society question conven- 
tional customs and when legal sanctions are used more to enforce a 
particular brand of morality than to prohibit socially detrimental conduct. 

The issue of whether individuals should be allowed to engage in activi- 
ties that are self-injurious but not detrimental to society has been de- 
bated vigorously over the years. Prohibitionists argue that it is difficult 
for a person, other than a recluse, to impair himself without inflicting 
secondary harm on others. Should self-injury produce incapacities, so- 
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ciety usually ends up bearing the treatment and subsistence costs. Liber- 
tarians do not find such arguments sufficiently convincing to justify a 
specific prohibition because some of the self-injurious activities that 
society approves may be as bad or worse than those it outlaws. Norma- 
tive changes over time regarding private conduct tend to favor an indi- 
vidualistic ethic. Consequently, many activities that were formerly 
prohibited by law have been exempted from legal sanctions. р 

Some groups have their freedom curtailed by socially condoned dis- 
crimination. Here, the alternatives available to a person are limited by 
skin color, sex, religion, ethnic background, or social class, regardless 
of capabilities. When self-determination is prejudicially restricted, those 
who are subordinated remove inequities by altering practices that com- 
promise or temporize the professed values of society. 

Freedom deals with rights as well as options and behavioral restraints. 
Man’s struggle for freedom is principally aimed at structuring societal 
contingencies so that certain forms of behavior are exempted from aver- 
sive control. After protective laws are built into the system, there are 
certain things that a society may not do to an individual, however much 
it might like to. Legal prohibitions on societal control create freedoms 
that are realities, not simply feelings or states of mind. Societies differ 
in their institutions of freedom and in the number and types of behaviors 
that are officially exempted from punitive control. Social systems that 
protect journalists from punitive control, for example, are freer than 
those that allow authoritative power to be used to silence critics or their 
vehicles of expression. Societies that possess an independent judiciary 
ensure greater social freedom than those that do not. 

In philosophical discourses, freedom is often considered antithetical 
to determinism. When defined in terms of options and rights, there is no 
incompatibility of freedom annd determinism. From this perspective, 
freedom is not conceived negatively as the absence of influences or simply 
the lack of external constraints. Rather, it is defined positively in terms 
of the skills at one’s command and the exercise of self-in 
which choice of action depends. 

Psychological analyses of freedom eventually lead to discourses on the 
metaphysics of determinism. Are people partial determiners of their 
own behavior, or are they ruled exclusively by forces beyond their соп- 
trol? The long-standing debate over this issue has been enlivened by 
Skinner’s (1971) contention that, apart from genetic contributions, human 
behavior is controlled solely by environmental contingencies; for exam- 
ple, “A person does not act upon the world, the world acts upon him” 
[р. 211]. A major problem with this type of analysis is that it depicts the 
environment as an autonomous force that automatically shapes and con- 
trols behavior. Environments have causes as do behaviors. For the most 
part the environment is only a potentiality until actualized and fashioned 
by appropriate actions. Books do not influence people unless someone 
writes them and others select and read them. Rewards and punishments 
remain in abeyance until prompted by appropriate performances. 


fluence upon 
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It is true that behavior is regulated by its contingencies, but the con- 
tingencies are partly of a person’s own making. By their actions, people 
play an active role in producing the reinforcing contingencies that im- 
pinge upon them. Thus behavior partly creates the environment, and 
the environment influences the behavior in a reciprocal fashion. To the 
oft-repeated dictum, change contingencies and you change behavior, 
should be added the reciprocal side, change behavior and you change 
the contingencies. 

The image of man’s efficacy that emerges from psychological research 
depends upon which aspect of the reciprocal control system one selects 
for analysis. In the paradigm favoring environmental control, investi- 
gators analyze how environmental contingencies change behavior [B = 
f(E)]. The personal control paradigm, on the other hand, examines how 
behavior determines the environment [Е = f(B)]. Behavior is the effect 
in the former case, and the cause in the latter. Although the reciprocal 
sources of influence are separable for experimental purposes, in every- 
day life two-way control operates concurrently. In ongoing interchanges, 
one and the same event can thus be a stimulus, a response, or an envi- 
ronmental reinforcer depending upon the place in the sequence at which 
the analysis arbitrarily begins. 

A survey of the literature on reinforcement confirms the extent to 
which we have become captives of a one-sided paradigm to map a bidirec- 
tional process. Environmental control is overstudied, whereas personal 
control has been relatively neglected. To cite but one example, there 
exist countless demonstrations of how behavior varies under different 
schedules of reinforcement, but one looks in vain for studies of how peo- 
ple, either individually or by collective action, succeed in fashioning 
reinforcement schedules to their own liking. The dearth of research on 
personal control is not because people exert no influence on their environ- 
ment or because such efforts are without effect. Quite the contrary. 
Behavior is one of the more influential determinants of future contingen- 
cies. As analyses of sequential interchanges reveal, aggressive individu- 
als actualize through their conduct a hostile environment, whereas those 
who display friendly responsiveness produce an amicable social milieu 
within the same setting (Rausch, 1965). We are all acquainted with 
problem-prone individuals who, through their aversive conduct, pre- 
dictably breed negative social climates wherever they go. 

It should be noted that some of the doctrines ascribing preeminent 
control to the environment are ultimately qualified by acknowledgment 
that man сап exercise some measure of countercontrol (Skinner, 1971). 
The notion of reciprocal interaction, however, goes considerably beyond 
the concept of countercontrol. Countercontrol portrays the environment 
as an instigating force to which individuals react. As we have already 
seen, people activate and create environments as well as rebut them. 

People may be considered partially free insofar as they can influence 
future conditions by managing their own behavior. Granted that selection 
of particular courses of action from available alternatives is itself deter- 
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mined, individuals can nevertheless exert some control over the factors 
that govern their choices. In philosophical analyses all events can be 
submitted to an infinite regression of causes. Such discussions usually 
emphasize how man’s actions are determined by prior conditions but 
neglect the reciprocal part of the process showing that the conditions 
themselves are partly determined by man’s prior actions. Applications 
of self-control practices demonstrate that people are able to regulate 
their own behavior in preferred directions by arranging environmental 
conditions most likely to elicit it and administering self-reinforeing con- 
sequences to sustain it. They may be told how to do it and initially be 
given some external support for their efforts, but self-produced influ- 
ences contribute significantly to future goal attainment. 

To contend, as environmental determinists often do, that people are 
controlled by external forces and then to advocate that they redesign 
their society by applying behavioral technology undermines the basic 
premise of the argument. If humans were in fact incapable of influencing 
their own actions, they could describe and predict environmental events 
but hardly exercise any intentional control over them. When it comes to 
advocacy of social change, however, thoroughgoing environmental de- 
terminists become ardent exponents of man’s power to transform en- 
vironments in pursuit of a better life. 

In backward causal analyses, conditions are usually portrayed as ruling 
man, whereas forward deterministic analyses of goal setting and attain- 
ment reveal how people can shape conditions for their purposes. Some 
are better at it than others. The greater their foresight, proficiency, and 
self-influence, all of which are acquirable skills, the greater the pro 
toward their goals. Because of the capacity for recipr 
people are at least partial architects of their own destinies. It is not de- 
terminism that is in dispute, but whether it is treated as a one-way or 
а two-way control process. Considering the interdependence of behavior 
and environmental conditions, determinism does not imply the fatalistic 
view that man is but a pawn of external influences. 

Psychological perspectives on determinism, like other aspects of the- 
orizing, influence the nature and scope of social practice. Environmental 
determinists are apt to use their methods primarily in the service of 
institutionally prescribed patterns of behavior. Personal determinists 
are more inclined to cultivate self-directing potentialities in man. The 
latter behavioral approach and humanism have much in common. Be- 
havioral theorists, however, recognize that "self-actualization" is by no 
means confined to human virtues. People have numerous potentialities 
that can be actualized for good or ill. Over the years, man has suffered 
considerably at the hands of self-actualized tyrants. A self-centered ethic 
of self-realization must therefore be tempered by concern for the social 
consequences of one's conduct. Behaviorists generally emphasize envi- 
ronmental sources of control, whereas humanists tend to restrict their 


interest to personal control. Social learning encompasses both aspects 
of the bidirectional influence process. 


gress 
ocal influence, 
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When the environment is regarded as an autonomous rather than as 
an influenceable determinant of behavior, valuation of dignifying human 
qualities and accomplishments is diminished. If inventiveness emanates 
from external circumstances, it is environments that should be credited 
for people’s achievements and chastised for their failings or inhumanities. 
Contrary to the unilateral view, human accomplishments result from 
reciprocal interaction of external circumstances with a host of personal 
determinants including endowed potentialities, acquired competencies, 
reflective thought, and a high level of self-initiative. 

Musical composers, for example, help to shape tastes by their creative 
efforts, and the public in turn supports their performances until advo- 
cates of new styles generate new public preferences. Each succeeding 
form of artistry results from a similar two-way influence process for 
which neither artisans nor circumstances deserve sole credit. 

Superior accomplishments, whatever the field, require considerable 
self-disciplined application. After individuals adopt evaluative standards, 
they expend large amounts of time, on their own, improving their per- 
formances to the point of self-satisfaction. At this level of functioning, 
persistence in an endeavor is extensively under self-reinforcement con- 
trol. Skills are perfected as much, or more, to please oneself as to please 
the public. 

Without self-generated influences, most innovative efforts would be 
difficult to sustain. This is because the unconventional is initially resisted 
and gradually accepted only as it proves functionally valuable or wins 
prestigious advocates. As a result, the early efforts of innovators bring 
rebuffs rather than rewards or recognition. In the history of creative 
endeavors, it is not uncommon for artists or composers to be scorned 
when they depart markedly from convention. Some gain recognition later 
in their careers. Others are sufficiently convinced of the worth of their 
work that they labor indefatigably even though their productions are 
negatively received during their lifetimes. Ideological and, to a lesser 
extent, technological advances follow similar courses. Most innovative 
endeavors receive occasional social support in early phases, but environ- 
mental conditions alone are not especially conducive to unconventional 
developments. 

The operation of reciprocal influence also has bearing on the public 
concern that advances in psychological knowledge will produce an in- 
crease in human manipulation and control. A common response to such 
apprehensions is that all behavior is inevitably controlled. Social influ- 
ence, therefore, is not a question of imposing controls where none existed 
before. This type of argument is valid in the sense that every act has 
a cause. But it is not the principle of causality that worries people. At 
the societal level their misgivings center on the distribution of controlling 
power, the means and purposes for which it is used, and the availability 
of mechanisms for exercising reciprocal control over institutional prac- 
tices. At the individual level, they are uneasy about the implications of 
psychotechnology in programming human relations. 
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Possible remedies for exploitative use of psychological techniques are 
usually discussed in terms of individual safeguards. Increased knowledge 
about modes of influence is prescribed as the best defense against manip- 
ulation. When people are informed about how behavior can be controlled, 
they tend to resist evident attempts at influence, thus making manipula- 
tion more difficult. Awareness alone, however, is a weak countervalence. 

Exploitation was successfully thwarted long before there existed a 
discipline of psychology to formulate principles and practices of behavior 
change. The most reliable source of opposition to manipulative control 
resides in the reciprocal consequences of human interactions. People 
resist being taken advantage of and will continue to do so in the future 
because compliant behavior produces unfavorable consequences for 
them. Sophisticated efforts at influence in no way reduce the aversive- 
ness of yielding that is personally disadvantageous. Because of reciprocal 
consequences, no one is able to manipulate others at will, and everyone 
experiences some feeling of powerlessness in getting what they want. 
This is true at all levels of functioning, individual and collective. Parents 
cannot get their children to follow all their wishes, while children feel 
constrained by their parents from doing what they desire. At univer- 
sities, the administrators, faculty, students, and alumni all feel that 
the other constituencies are unduly influential in promoting their self- 
interests but that one’s own group is granted insufficient power to alter 
the institutional practices. In the political arena, Congress feels that the 
executive branch possesses excessive power, and conversely the execu- 


tive branch feels thwarted in implementing its policies by congressional 
counteraction. 
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and industry but adversely affect the quality of life for other sectors of 
society. 

As the previous example illustrates, improved knowledge of how to 
influence behavior does not necessarily raise the level of social control. 
If anything, the recent years have witnessed a diffusion of power, cre- 
ating increased opportunities for reciprocal influence. This has enabled 
people to challenge social inequities, to effect changes in institutional 
practices, to counteract infringements on their rights, and to extend 
grievance procedures and due process of law to activities in social con- 
texts that hitherto operated under unilateral control. The fact that more 
people wield power does not in and of itself ensure a humane society. 
In the final analysis, the important consideration is the purposes that 
power serves, however it might be distributed. Nor does knowledge- 
ability about means of influence necessarily produce mechanical respon- 
siveness in personal relations. Whatever their orientations, people 
model, expound, and reinforce what they value. Behavior arising out 
of purpose and commitment is no less genuine than improvised action. 

The cliché of 1984 and its more recent kin diverts public attention from 
regulative influences that pose continual threats to human welfare. Most 
societies have instituted reciprocal systems that are protected by legal 
and social codes to prevent imperious control of human behavior. Al- 
though abuses of institutional power arise from time to time, it is not 
totalitarian rule that constitutes the impending peril. The hazards lie 
more in the intentional pursuit of personal gain, whether material or 
otherwise, than in control by coercion. Detrimental social practices arise 
and resist change, even within an open society, when many people benefit 
from them. To take a prevalent example, inequitable treatment of dis- 
advantaged groups for private gain enjoys public support without re- 
quiring despotic rule. 

Man, of course, has more to contend with than inhumanities toward 
one another. When the aversive consequences of otherwise rewarding 
life-styles are delayed and imperceptibly cumulative, people become 
willful agents of their own self-destruction. Thus if enough people benefit 
from activities that progressively degrade their environment, then 
barring contravening influences, they will eventually destroy their en- 
vironment. Although individuals contribute differentially to the problem, 
the harmful consequences are borne by all. With growing populations 
and spread of lavish life-styles taxing finite resources, people will have 
to learn to cope with new realities of human existence. 

Psychology cannot tell people how they ought to live their lives. It can, 
however, provide them with the means for effecting personal and social 
change. And it can aid them in making value choices by assessing the 
consequences of alternative life-styles and institutional arrangements. 
As a science concerned about the social consequences of its applications, 
psychology must also fulfill a broader obligation to society by bringing 
influence to bear on public policies to ensure that its findings are used 
in the service of human betterment. 
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5. VIOLENCE ОМ 
TELEVISION: 


How Harmful? 


It is one thing to suspect that violence on television produces more 
aggressive or violent behavior in people who view it, but it is quite an- 
other thing to prove it. The following article and letters indicate that 
even among professional psychologists there is no final consensus on the 
proper experimental procedures for testing this important hypothesis. 
Without question, the United States has a more violent society than we 
can tolerate. But in order to try to reduce the level of violence, it would 
be extremely helpful to understand its causes. On one hand, we were a 
violent society without television, and television violence certainly re- 
flects what people want to see (as the television producers like to tell 
us). On the other hand, since so many people watch television so much, 
television programming has an almost unlimited potential to shape atti- 
tudes and behavior. 

There is a vast literature on this subject, and the following articles can 
serve only as an introduction to some of the research and theory in this 
area. The particular value of the research of Leonard Eron, L. Rowell 
Huesmann, Monroe Lefkowitz, and Leopold Walder lies in the fact that 
they had, at the time of publication, more than 10 years’ worth of data 
on a large number of subjects through the subjects’ adolescence. These 
data describe both the subjects’ aggressive behavior and their television 
viewing habits. One of Eron et al.’s main findings was that there is a sig- 
nificant relationship between boys’ preference for violent television pro- 
grams in the third grade and their level of aggressiveness (as rated by 
their peers) 10 years later. Interestingly, no correlations were found be- 
tween viewing violence on television and aggressive behavior in females. 

The weakness of any correlational study, of course, is that there are 
usually several alternative explanations for a significant correlation. 
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The present research is a good example of this. What are the alternative 
hypotheses discussed by Eron et al.? Do their arguments on these alter- 
natives seem convincing to you? Why, or why not? In particular, consider 
the alternative hypothesis d in Figure 2—that aggression in the third 
grade (whatever its causes) contributes both to a preference for viewing 
violence on television and to later aggression. As you will see, the com- 
plete understanding of much of this sort of research depends on a clear 
understanding of correlation coefficients. 

One interesting feature of scientific literature is that it allows for public 
disagreement, or rebuttals concerning a previously published article. 
Many letters of rebuttal were submitted after publication of the Eron 
et al. report, and some of them are reprinted here to give you the flavor 
of the push-and-tug nature of scientific discourse. The letters by Gilbert 
Becker, Robert Kaplan, Dennis Howitt, and Herbert Kay make different 
points regarding the original study, and in the Huesmann et al. letter 
the original authors defend their original conclusions. As you read these 
letters, try to jot down the major arguments. Who do you think made the 
best points? On the basis of the following article and letters, what is your 
conclusion about the issue? Is more research needed, or just a better 
understanding of the existing data? How might one design an even better 
study for discovering the relationship between the viewing of violence 
on television and aggressive behavior? 
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DOES TELEVISION VIOLENCE 
CAUSE AGGRESSION? 


Leonard D. Eron, University of Illinois, Chicago Circle 


Monroe M. Lefkowitz, New York State Department 
of Mental Hygiene, Albany 


L. Rowell Huesmann, Yale University 


Leopold O. Walder, Organization for Research 
in the Behavioral Sciences, Greenbelt, Maryland 


With the increasing prominence of violence in our society, social scien- 
tists have been turning their attention to the antecedents of aggressive 
behavior in children and adults. Television programming with its heavy 
emphasis on interpersonal violence and acquisitive lawlessness has been 
assigned a role both in inciting aggression and teaching viewers specific 
techniques of aggressive behavior. The relation between overt aggres- 
sion and television habits has been demonstrated in a few survey studies 
which, however, because of the nature of surveys have not been able 
to discriminate cause and effect (Bailyn, 1959; Eron, 1963; Schramm, 
Lyle, & Parker, 1961). 

On the other hand, manipulative laboratory experiments have demon- 
strated an immediate effect on the extent of aggressive behavior of sub- 
jects who have witnessed aggressive displays on film (Bandura, Ross, 
& Ross, 1963a; Berkowitz, Corwin, & Heironimus, 1963). The latter 
studies, however, can be criticized for not duplicating real-life television 
viewing situations and possibly not accounting for anything more than 
a transient effect on the viewer. Hartup and Yonas (1971) stated in their 
review of developmental psychology in the latest Annual Review of 
Psychology, 

Current studies of the childhood determinants of aggression are not extensive. The 

report of a presidential commission, Violence and the Media (Baker & Ball, 1969), 

indicates that child psychologists possess much information concerning the deter- 

minants of aggression but very little stemming directly from naturalistic sources 

[р. 875]. 
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One possible way of utilizing survey procedures to demonstrate cause 
and effect is to use a longitudinal context. By contrasting the magnitude 
of contemporaneous and longitudinal correlations between two sets of 
variables, it is possible to account more clearly for which of the variables 
is antecedent and which consequent. The authors have now accumulated 
data on both aggressive behavior and television viewing habits over a 
10-year period in a large group of subjects first seen when they were 
8-9 years of age. Thus we can implement such an analysis. | 

The hypotheses of this research are that a young adult’s aggressive- 
ness is positively related to his preference for violent television when he 
was 8-9 years old and, furthermore, that his preference for violent 
television during this critical period is one cause of his aggressiveness. 


METHOD 


Longitudinal data were collected on 427 teenagers of an original group 
of 875 children who had participated in a study of third-grade children 
in 1960 (Eron, 1963; Eron, Walder, & Lefkowitz, 1971). The original 875 
constituted the entire third-grade population of a semirural county in 
New York’s Hudson River Valley, while the 427 subjects were those 
who could be located and interviewed 10 years later. 

The information collected about these subjects in both time periods 
falls into two classes: (a) measures of aggression and (b) potential pre- 
dictors of aggression. During the third-grade interviews, four different 
data sources had been used: the subject, his peers, his mother, and his 
father. Ten years later, the data sources were the subject and his peers. 
For convenience, this later time period will be designated as the thir- 
teenth grade. 

The variables used in the study are listed in Table 1. Two variables are 
of particular importance: peer-rated aggression and preference for vio- 
lent television programs. In the third grade, peer-rated aggression 
scores were obtained by asking each child to nominate any of his class- 
mates as 10 “guess who” items describing aggressive behavior; for ex- 
ample, “Who pushes and shoves other children?” “Who takes other 
children’s things without asking?” “Who starts a fight over nothing?” 
“Who says mean things?” These aggression items were interspersed 
among a series of other peer nomination questions. The validity and 
reliability of the aggression measure have been discussed elsewhere 
(Eron et al., 1971; Walder, Abelson, Eron, Banta, & Laulicht, 1961). 

The peer rating instrument was revised slightly for the thirteenth- 
grade study. One item which was deemed inappropriate for 19-year-old 
subjects was dropped. Furthermore, since the subjects were no longer 
in school, the procedure was administered individually in a face-to-face 
interview. The number of possible peers any subject could nominate 
was widened beyond his own third-grade classroom to include his class- 
mates through high school. Thus each subject was rated by a larger set 
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of raters in the thirteenth grade which included many of the raters from 
the third grade. 

In the third grade the children’s preferences for violent television were 
obtained by asking each mother for her child’s three favorite television 
programs. All programs mentioned were then categorized as violent 
or nonviolent by two independent raters with 94% agreement in their 
ratings. Differences in the remaining 6% of the programs were resolved 
by mutual discussion between the raters. Each subject received a score 
according to the number of violent television programs he was reported 
by his mother as favoring. Scores ranged from 1 (for no violent programs) 
to 4 (for three violent programs). 

In the 10-year follow-up study, each subject himself was asked for his 
four current favorite television programs. All programs were then cate- 
gorized for presence or absence of violence by two independent raters 
who were only a few years older than the subjects themselves. Scores 
were assigned to each program on the basis of agreement between the 
raters. If they both agreed a program was nonviolent, the program re- 
ceived a score of 0; if they both agreed it was violent, the score was 2; if 
they disagreed in categorization, the program was assigned a score of 1. 
Here again there was good agreement between the two raters. They 
agreed on 81% of 125 programs mentioned by the subjects. The score for 
each subject was the sum of the violence ratings of the four programs 
mentioned. 

The designation by these raters of violent and nonviolent programs 
agreed very well with the assignment of programs by Feshbach and 
Singer (1971) to aggressive and nonaggressive diets in their field experi- 
ment (see below). Furthermore, the judgments of our raters were in 
close agreement with the results obtained by Greenberg and Gordon 
(1970), who did an extensive rating study in which they used as raters 
both established television critics (approximately 45) and 300 subjects 
randomly selected from the Detroit, Michigan, telephone book. Of the 
20 programs which Greenberg and Gordon indicated had the highest 
violence ratings, 19 were selected as violent by our raters. For the 427 
cases in the 10-year follow-up study, there was a correlation of .94 be- 
tween the Greenberg-Gordon average ratings and our ratings. 


RESULTS 


The 427 subjects studied in both the third and thirteenth grades con- 
sisted of 211 males and 216 females whose modal age at the time of the 
thirteenth-grade interview was 19. The sample was at the higher end 
of the average range in intelligence with a mean IQ of 109 and was some- 
what middle class in social status. (The dropout rate from participation 
in the study between the third and thirteenth grade was considerably 
higher for subjects who displayed high aggression in the third grade than 
for those who displayed low aggression. For example, 57% of the low 
quartile in aggression were retained in the follow-up sample, while only 
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27% of the high quartile were retained. This can be explained by a num- 
ber of factors, including the finding that residential mobility was corre- 
lated significantly with aggression in the sample. | | 
Preliminary analysis indicated that the measures of aggression dis- 
tinguished the males from the females. There were statistically signifi- 
cant differences between males’ and females’ mean scores on every 
measure of aggression in both grades. The differences were more pro- 
nounced in the thirteenth grade. In addition, a principal component 
factor analysis of subjects’ sex and the variables that were used in the 
study yielded a first principal factor whose largest loading was for the 
subjects’ sex and whose next largest loadings were for the measures 
of aggression. Finally, a comparison of girls in the highest quartile 
revealed a difference in MMPI profiles; the high-aggressive girls were 
significantly more masculine in their interests and attitudes. Because of 
these findings, the data for males and females were analyzed separately. 


Correlations between Preference for Television, 
Violence and Aggression 


The correlations between 
gression at both grade levels 
in Table 2. 


preference for television violence and ag- 
are presented separately for boys and girls 


Table 2 
Correlations among Violence Ratings of Prefered Television Programs 
and Measures of Aggression at Two Different Periods 


TVVL3 | Accs | TVvLI3 | AGGis | AsB13 | MMPI -49513 
Boys 
TVVL3 1 21 05 3“ 10 12 
AGG3 01 38 ла E 
TVVLI3 1 —.05 01 08 
AGGI3 1 .49* .39* 
ASB13 1 .46* 
ММРІ-49513 1 
Girls 
TVVL3 1 02 08 -13 -0 —.05 
AGG3 1 -.08 AT o 13 
TVVLI3 1 —05 0 al 
AGGIS 1 23 p 
ASB13 1 .45* 
ММРІ-49513 


Note.—The number 3 or 13 i 

teenth grade, when the data were obtained. ТУУ: 
violence. AGG = peer-rated aggression; ASB 
havior; MMPI-498 = sum of T scores on Scales 4 and 9 of MMPI. 


* Indicates correlation is significantly different from zero beyond .01 level of confidence. 
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These correlations must be interpreted in light of the total number of 
correlations being computed for the sample of 427 subjects and the dis- 
tribution of the variables. Because of the large number of potential cor- 
relations, only those with a probability less than .01 on a two-tailed test 
were considered significant and only those with a probability less than 
.001 on a two-tailed test were called highly significant. On the other 
hand, it should be pointed out that the unusual skewness and kurtosis 
of many of the variables may impose a limit on the size of their inter- 
correlations. 

From Table 2, one can see that there is a highly significant relation 
between boys’ preferences for violent television programs in the third 
grade (TVVL3) and their peer-rated aggression in the thirteenth grade 
(АСС13). Similarly, there is a significant contemporaneous relation be- 
tween the boys’ television preferences in the third grade and peer-rated 
aggression in the third grade (AGG3). Neither of these effects was appar- 
ent for females. While the correlation between third-grade preferences 
and thirteenth-grade peer-rated aggression explains only 10% of the 
variance in aggression, 10% is impressive when one considers the prob- 
able limitations on the size of the correlation imposed by the skewed 
distributions of the variable, the large number of variables affecting 
aggression, the comparatively small explanatory power of these other 
variables (see below), and the 10-year lag between measurement times. 
The extremely low likelihood of achieving such a correlation by chance 
is a good indicator of the strength of the relation between preference for 
violent television at age 8 years and peer-rated aggression at age 19. 

As has been pointed out previously (Eron et al., 1971), when distri- 
butions are unusual, an analysis of variance may, more clearly than a 
product-moment correlation, reveal the relation between the two vari- 
ables. Hence the male subjects were partitioned into low, medium, and 
high television violence groups, representing approximately the lowest 
10%, the middle 80%, and the upper 10%, respectively. The results of 
one-way analyses of variance on this independent variable with aggres- 
Sion measures as the dependent variables are shown in Table 3. Now 
опе can see that there is a highly significant relation between preference 
for violent television in the third grade and aggression in the thirteenth 
grade, whether the measure is one of peer-rated aggression or self- 
ratings of aggression (antisocial behavior and ММР1-495). 


Causal Analysis 


Having established that there exists a highly signifieant relationship 
between a preference for violent television in the third grade and ag- 
gressive habits in the thirteenth grade, one can consider the alternative 
causal explanations for this phenomenon. Of course, we cannot demon- 
strate that a particular hypothesis is true. One can only reject untenable 
hypotheses and present evidence on the plausibility of the remaining 
hypotheses. 
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Table 3 
Mean Aggression Scores as a Function of Television Violence Ratings 
of Programs Preferred by Boys in Third Grade 


AGG3 АСС13 ASB ММР!-495 

TVVL3 п 

M SD м SD M SD M SD 
Low 31 9.06 9.91 5.14 5.05 25.58 12.12 121.97 20.50 
Medium 139 11.19 11.54 8.4 9.88 22.06 12.69 122.50 19.33 
High 14 2100 13.79 16.46 13.01 30.86 14.23 135.86 18.14 
Total 184 11.58 11.75 8.27 986 23.33 12.90 123.42 19.67 
ps 5.43**** 6.82***** = 4.04%%% 3.63** 3.11* 


Note.—Abbreviations: AGG = peer-rated aggression; ASB 
antisocial behavior; MMPI-49S = sum of T scores on Scales 


? Because of heterogeneity of variance, a t test between the two most discrepant means 


for AGGI3 was performed. The t was conservatively evaluated by using degrees of freedom 
equal to the n for the smallest group, that is, 14. 


*p < .05. 

**p < .08, 
жр < .01. 
Mee < (05, 
жнр < .001. 


= self-rating of frequency of 
4 and 9 of MMPI. 


grade and thirteenth-grade aggression 
i a relation between 


- The probability of a 
oss-lagged correlations 


А : for violent television 
in the third grade sti 


> п, and this aggression 
leads to thirteenth. Ў being тела каља f 
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Preference for Preference for 
violent TV in __________ 95 ———-———— Violent TV in 
the third grade the 13th grade 
(TVVL3) "~ (TVVL13) 
o | 
.21 —.05 
.31 


Aggression in Aggression in 
the third grade ——— 38 ——————- the 18th grade 
(AGG3) (AGG13) 
FIGURE 1. The correlations between a preference for violent television and peer-rated 
aggression for 211 boys over a 10-year lag. 


TVVL3 TVVL3 
„+ + |+ 
AGG3 AGGIS AGG3 = 
(а) |+ T WA 
А6613 Library “, 
(b) 
e 
AGG3 S 
| + Caleutta > 
TVVL3 AGG3 * B, & WO 
+ “+ +м “ыы ы 
AGGI3 AGGI3 TVVL3 
(c) (d) 
AGG3 
“+ -м 
AGGI3 TVVL13 


(e) 
FIGURE 2. Five feasible causal hypotheses for the correlations presented in Figure 1. 


gression, the correlation between early and later aggression would have 
been less than the product of the two intermediate correlations. It 
was not. 

One cannot reject so easily the more realistie alternative hypothesis 
diagramed in Figure 2d. This causal hypothesis asserts that early aggres- 
sion causes both contemporaneous preferences for violent television and 
later aggression. Part of this hypothesis, that early aggression contrib- 
utes to later aggression, is quite probably true. What is of interest here, 
though, is whether or not the relation between early television pref- 
erences and later aggression can be explained as an artifact of early 
aggression. One can obtain evidence to refute this idea by computing 
the partial correlation between third-grade television violence (TVVL3) 
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and thirteenth-grade aggression (AGGI3) while controlling wenn 
grade aggression (AGG3). If the hypothesis diagramed in Figure iae 
the complete explanation of the correlation between early te ps : 
preferences and later aggression, then the partial correlation would ч 
been zero. But it was not. It was .25 ога distance of only .06 below 2 
original correlation between a preference for television violence in t је 
third grade and thirteenth-grade aggression. Hence, the hypothesis 
diagramed in Figure 2d is implausible as a complete causal explanation. 
The final plausible alternative to be considered is that early aggression 
causes a diminished preference for violent television. This theory is 


diagramed in Figure 2е. If one views the cross-lagged correlations as 
deviations from the initial 


between early aggression а: 
that early aggression had 


ervative assumption of a very high temporal 
would be [(.21 + (—.05)) x 

М.70 x .70]/2 = .06. With this correction, 

ion di preference for violent television becomes untenable, 

lagged correlation from ea 


since the cross- rly aggression to later tele- 
vision preferences is very close to the base line. 


As Rozelle and Campbell (1969), Kenny (in press), and others have 
pointed out, the validity of cross-lagged 


grade synchronous correlations is indicative of lack of stationarity and 
necessitates consideration of Some artifactual effects, 


at the aggression 
between the third and thirteenth grade 


pattern. 
Similarly, a decrease in the reliabilit 


in the thirteenth grade could have caused the decrease in the correlation 
between third-grade aggression and thirteenth- 
over the synchronous correlation. i 


Y of the television violence ratings 
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liability. Furthermore, the reliability of thirteenth-grade television 
violence could not affect the cross-lagged correlation from television 
violence to aggression. Hence a decrease in the reliability of the tele- 
vision violence variable is not sufficient to explain the pattern of cross- 
lagged correlations. 

Basing the causality argument on the finding that the cross-lagged 
correlation is greater than the initial synchronous correlation raises 
another question. Was the increase significant? It is not appropriate to 
compare the cross-lagged correlation directly with the third-grade syn- 
chronous correlation since that correlation should erode over time. Using 
Kenny’s (in press) method for estimating erosion rates yields a rate of 
-605 for the correlation between television violence and aggression. Thus 
one should compare the cross-lagged correlation of .31 with .605 (.21) = 
.127. For N = 184, this comparison yields a Fisher's z of 2.33, which is 
significant at the .02 level. 

On the basis of the cross-lagged correlations and the significant partial 
correlation between early preferences for violent television and later 
aggression with early aggression controlled, one concludes that the 
single most plausible causal hypothesis is that a preference for watching 
violent television in the third grade contributes to the development of 
aggressive habits. This does not mean that other variables are not of 
equal or greater importance in stimulating aggression, but only that a 
preference for television violence and the viewing behavior that the 
preference indicates probably are independent and important causes of 
aggressive habits. 

Partial correlations. None of the data presented up to this point sheds 
evidence on the possibility that another variable such as IQ, social class, 
parental punishment, or parental aggression might have stimulated both 
the child’s preference for violent television and his aggressiveness. How- 
ever, one can test some of these hypotheses by computing the partial 
correlations between television violence and aggression with such vari- 
ables controlled. Table 4 contains these partial correlations. Neither the 
child’s aggression in the third grade, social class, mobility orientation of 
parent, IQ, parental punishment, parental aspirations for child, nor 
parental aggression accounts for the relationship. Nor can the relation 
be explained by the total number of hours of television watched by the 
subject in either the third or thirteenth grades. 

Multiple regression. As Darlington (1968) pointed out, one can treat 
the standardized coefficients in a multiple regression equation as mea- 
sures of the causal contributions of the predictor variables to the criterion 
variable. This approach assumes that all causal variables not in the re- 
gression equation are uncorrelated with those that are and that the cri- 
terion variable is not “causing” any predictor variable. While it is quite 
possible that these assumptions are violated, it is still worth while to 
examine the coefficients as approximate measures of causal contribu- 
tions. Hence a multiple regression equation was computed by a stepwise 
method that entered the third-grade variables into the equation in order 


a E 
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Table 4 
Partial Correlations between TVVL3 and AGGI13 with 
Other Variables Controlled 


PARTIAL CORRELATION 
CONTROLLED VARIABLE n BETWEEN TVVL3 
AND АСС13 
None 184 :81 
Third-grade variables 
Peer-rated aggression 184 25 
Father's occupational status 140 31 
Child's IQ 179 .28 
Father’s aggressiveness 140 .30 
Mother's aggressiveness 184 381 
Punishment administered 184 .31 
Parents’ aspirations for child 140 .30 
Parents’ mobility orientation 140 .81 
Hours of television watched 184 .30 
Thirteenth-grade variables 
Father's occupational status 182 .28 
Subject’s aspirations 149 .28 
Subject’s hours of television watching 183 .30 


Note.—Abbreviations: TVVL3 =a measure of preference for television violence in the 
third grade; AGG13 = peer-rated aggression in the thirteenth grade. 


£ the eriterion variable, thirteenth-grade 
quation shown in Table 5 predicts thirteenth- 
grade aggression solely from third-grade variables, 


variables were entered reveals that a pref- 


grade aggression among the third 
the hypothesis that a preference 


i : “grade television violence 
to thirteenth-grade aggression is the coefficient of third-grade television 
violence in a regression equation predicting thirteenth-grade aggression 
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Table 5 
Multiple Regression Prediction of Thirteenth-Grade 
Aggression (AGGI3) from Third-Grade Predictors 


=== 
FINAL 
STEP PREDICTOR MULTIPLE STANDARDIZED 
ENTERED! CORRELATION| COEFFICIENTS 

FOR PREDICTORS 
1 Television violence (TVVL3) .31 .29 
2 Parents’ mobility orientation 41 .28 
3 Child's identification with mother 44 –.08 
4 Parental nurturance of child 46 —.20 
5 Parental disharmony AT alt 
6 Parents’ aspirations for child 48 -.18 
7 Child's IQ 49 =й 

8 Mother's religiosity (frequency of 

church attendance) .50 .09 
9 Child's identification with father 51 12 
10 Ethnicity of family .51 —.09 
11 Parental rejection of child 52 .08 
12 Parental punishment of child 52 —.09 
13 Child's confessing to parents 52 -.07 
14 Father's aggressiveness .53 .06 
15 Child's hours of watching television 53 –.08 
16 Mother’s aggressiveness 53 —.08 
17 Father's occupational status 53 .02 


Third grade Thirteenth grade 


TV violence M TV violence 


.00 .24 
Aggression P am Би Aggression 


FIGURE 8: The path coefficients for a preference for watching violent television and peer- 
rated aggression. 


with third-grade aggression controlled. The obtained pattern of path 
Coefficients adds further credence to the argument that watching violent 
television contributes to the development of aggressive habits. 1 


DISCUSSION 


The above results indicate that television habits established by age 
8-9 years influence boys’ aggressive behavior at that time and at least 
through late adolescence. The more violent are the programs preferred 
by boys in the third grade, the more aggressive is their behavior both 
at that time and 10 years later. This relation between early television 
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habits and later aggression prevails both for peer-rated aggression and 
for self-ratings of aggression. Actually, these early television habits seem 
to be more influential than current viewing patterns since a preference 
for violent television in the thirteenth grade is not at all related to con- 
current aggressive behavior, nor are early television habits related to 
later television habits. : 

It would be very difficult to explain these results as methodological 
artifaets. While the peer-rated aggression measure possesses demon- 
Strated reliability (Walder et al, 1961), the thirteenth-grade scores 
represent more than the temporal reliability of the third-grade measure. 
Not only had 10 years of behavior intervened between the measurement 
periods, but the set of raters of each Subject was substantially different 
as well. It is also unlikely that the findings were produced by a common 
method rather than content. Method refers to the form of the measuring 
device and the procedure for obtaining and calculating the score; content 


refers to what one intends to measure. Feshbach (1970), for example, 
warned that the 


predictive utility of sociome! 


the substantial method variance yielded by each procedure, and it is evident that 


aggression is to the aggression criterion, the weaker 
are the relationships obtained [p. 181]. 


The peer-rated a 
be susceptible to such criticism as it ha: 


iolent television were measured in both 
it is reasonable to assume that a child's 
gram is very highly correlated with the 
to that program. The report of television 


: аз made by the mother, who probably was 
strongly influenced by her child’s actual viewing habits, especially since 
the question regarding televisi 
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oretical explanations of a violence effect. While minor variations between 
preferences and watching certainly must have occurred, it is difficult to 
see how such variations could have enhanced our results. 

These findings of a direct positive relation between the viewing of 
television violence and aggressive behavior on the part of the viewer 
corroborate in a long-term field study what has already been demon- 
strated as a short-term effect in the laboratory (Bandura et al., 1961, 
1963b; Berkowitz, 1964; Berkowitz & Rawlings, 1963). Because the latter 
are manipulative studies in which systematically varied treatments were 
administered to randomly selected subjects under controlled conditions, 
statements about cause and effect relations based on their findings can 
be made with more confidence perhaps than those based on findings 
of field studies. In such studies many uncontrolled variables are unac- 
counted for, and observation and measurement cannot be as precise. 
We feel that our findings relating television violence and aggressive 
behavior over a 10-year period strengthen the conviction that this is 
indeed a real relation. The causal direction indicated for the relation 
is that viewing violence regularly on television at age 8 leads to more 
aggressive behavior on the part of the viewer at that time and also in 
Subsequent years than does viewing nonviolent programs. Thus the 
Significance of the laboratory studies is extended to indicate that the 
influence of television violence is not confined to short-term effects. 

The laboratory studies of Bandura (Bandura, 1965; Bandura et al., 
1961, 1963a, 1963b) furnish a theoretical model to account for these re- 
sults. Bandura and his associates have demonstrated that aggressive 
behaviors new to the subject’s repertoire of responses, as well as those 
already well established in his repertoire, can be evoked by observation 
of models performing such aggressive behaviors. The fact that the model 
does not get punished for the aggressive behaviors he develops serves 
to lower whatever inhibitions subjects have against the expression of 
similar behaviors. The likelihood of performance of the observed aggres- 
sive behaviors is stronger when the model is rewarded for his aggressive 
behavior (Bandura, 1965). It is not unusual on the more violent television 
programs for central characters to obtain desirable goals by violent 
tactics. Continued exposure to such models probably strengthens the 
conviction that these behaviors are permissible, and observation of ac- 
quisition of desired goals by aggressive behaviors increases the proba- 
bility of performance of these behaviors by the observer. It is interesting 
in this regard that the more the subjects watch television at age 19 and 
the more violent the programs they prefer, the more likely are they to 
believe that situations depicted on television crime stories and westerns 


are realistic representations of life (r = .28 between judged realism of 


television programs and number of hours television is watched; and .86 
between judged realism and television violence). Thus subjects who 
watch television for many hours and prefer violent programs do not 
consider these aggressive behaviors as deviant but as appropriate ways 


of solving real-life problems. 
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The lack of an effect of television violence on girls also corroborates 
the Bandura laboratory studies, which have consistently found that boys 
perform significantly more imitative aggression than girls (Bandura et 
al., 1961, 1963b). Why this is so probably has to do with the differential 
socialization of boys and girls in regard to aggression. Very early in life, 
girls learn that physical aggression is an undesirable behavior for girls, 
and so they acquire other behaviors more suitable to expectations for 
girls. Since they do not learn physical aggression as a response to instiga- 
tion, they are rarely either rewarded or punished for such behavior and 
thus are not responsive to aggressive cues in the environment. Bandura 
(1965) has shown that when girls are positively reinforced for imitating 
aggressive behavior, they significantly increase such behavior and re- 
spond in a manner more similar to boys who are reinforced for the same 
behaviors. The results of Hokanson and Edelman (1966) would support 
this contention that lowered aggression levels in females are a function 
of lack of reinforcement for aggressive behavior. They found that females 


the female viewer is les 
imitate. 


f the current study with the somewhat 
ach and Singer (1971). In a bold attempt 
ry manipulation and a field setting, they 
indicate that the viewing of aggressive 
a diminution of aggressive behavior on 
al } ] and Singer regulated the amount of 
television aggression viewed by their Subjects over a six-week period 

nonaggressive diet of television pro- 
- The subjects were boys enrolled in 
es for the underprivileged. The sub- 
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jects watched television in groups of 10-18 boys at set times in specified 
locations. In two of the schools, participation by the subjects was com- 
pulsory. There was, of course, no voluntary choice of television pro- 
grams. There are indications that at least initially this lack of preference 
was resented by boys assigned to the nonviolent diet since it meant they 
could not watch their favorite programs. In a few cases the experiment- 
ers, in order to retain the cooperation of the subjects, permitted the boys 
in the nonaggressive group to watch one of the aggressive programs. 
Although all subjects were given the same cover story, some guessed 
the real purpose. Feshbach and Singer found that, of the boys around 
13 years old, those who were exposed to the aggressive television diet 
decreased in manifestations of aggression, while the control subjects 
increased in aggression. Feshbach and Singer’s limited and special sam- 
ple, their subject’s ages, and their methodology all introduce factors 
sufficient to explain the discrepancy between their results and ours. 
Actually the research of Feshbach and Singer addressed itself to a 
different question than did the present research, which is more con- 
cerned with the long-range effects of viewing television violence and 
with the consideration of more pervasive dispositions. In this regard, 
it is of interest that Feshbach and Singer used the peer rating measure 
which we had developed for our studies (Walder et al., 1961) in order 
to assess preexperimental aggression levels of their subjects and thereby 
separate them into high-aggressive and low-aggressive subsamples. 
They found: 
Regardless of experimental group, 
nomination measure display about twice as much 
as boys who score low on the peer nomination measure 
p. 91). 


boys who score high on the peer aggression 
aggressive behavior towards peers 
[Feshbach & Singer, 1971, 


Thus any manipulation that changes the peer nomination score would 
indeed have powerful effects, and we have demonstrated that it is this 
measure which is affected by television preference. While six weeks of 
violent television might be expected to stimulate more aggressive be- 
haviors, it could not be expected to have the effect on the development 
of aggressive habits that 5-10 years of viewing violence might have. 

The weight of evidence from this study when coupled with previous 
laboratory studies supports the theory that during a critical period in 
a boy’s development, regular viewing and liking of violent television lead 
to the formation of a more aggressive life style. 


SUMMARY AND CONCLUSIONS 

Am f a large-scale survey study of aggressive behavior 
in End ad piter had been the finding that children at that 
age who preferred violent television programs were more aggressive 
in school as rated by peers than children who preferred => vs E 
programs. In a 10-year follow-up study, 427 of the original 87 = а 
including 211 males and 216 females, were interviewed as to their tele- 
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vision habits. They again rated their peers on aggressive behavior = 
well as responding to questions concerned with variables not = ^ 
extensively in the current report. It was found that the violence у п 
grams preferred by the male subjects in Grade 3 was even more str = у 
related to aggression 10 years later. By the use of cross-lagged corre = 
tions, partial correlations, and multiple regression, it was demonstr ate 
that there is a probable causative influence of watching violent television 
programs in early formative years on later aggression. Of course, it is 
not claimed that television violence is the only cause of aggressive be- 
havior, since a number of other variables are also related to aggression, 
including ones in addition to the more than 20 variables we have reported 
on here, which could conceivably have had an influence over the 10 years 
covered by this longitudinal design. However, the effect of television 
violence on aggression is relatively independent of these other factors 
and explains a larger portion of the variance than does any other single 
factor which we studied, including IQ, social status, mobility aspirations, 
religious practice, ethnicity, and parental disharmony. 
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NOTES 


1. The authors are indebted to John М. Neale, State University of New York at Stony 
Brook, for this path analysis of our data and to David Kenny, Northwestern University, 
for suggestions regarding interpretation of the cross-lagged correlations. 


COMMENT: CAUSAL ANALYSIS IN R-R STUDIES: 
TELEVISION VIOLENCE AND AGGRESSION 


Gilbert Becker, University of Winnipeg 


Eron, Huesmann, Lefkowitz, and Walder (1972) presented data of high 
interest and of apparently high internal validity which strongly suggest 
that 8-9-year-old males, representing the particular population sampled, 
who prefer to watch television programs with high violence content tend 
to be peer-rated high on aggression not only at that age level (r = 21) 
but also 10 years later (r = .81). The stated hypotheses are (a) early 
preference for watching television violence “is positively related” to later 


Becker, Gilbert. Comment: Causal analyses in R-R studies: Television violence and 
aggression. American Psychologist, 1972, 27, 967-968. Copyright 1972 by the American 


Psychological Association. Reprinted by permission. 
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aggressiveness, and (b) this preference “is one cause of his се 
ness.” The authors conclude that (a) a positive correlation does exist an : 
(b) "it was demonstrated that there is a probable causative influence ~ 
watching violent television programs ... on later aggression.” In the 
section headed “Causal Analysis” the authors state that alternative 
causal explanations can be considered but that “one cannot demonstrate 
that a particular hypothesis is true. One can only reject untenable hy- 
potheses and present evidence on the plausibility of the remaining 
hypotheses [p. 257]." 

We agree with the immediately preceding statement but add that 


in a context in which one uses the terms “plausibility” and “probable 
causative influence” interchangeably, 


very important assumption, which is t 
each and across all cause-and. 


nalities by failure to consider at least one 
hypothesis from each is to raise 


applying a win-by-default logic. 
Before continuing, we must make с] 


the term “directionality.” In any R-R study, there are three and only 


three directionalities of cause and effect possible associated with the two 
correlated responses (R, and R 


Ды i У 2): (а) воте aspect(s) of the stimulus 
situation associated with or predicted by В, can cause R, (initial hy- 
pothesis); (b) some aspect(s) of the stimulus Situation associated with or 
predicted by R, can cause R,, provided that temporal considerations do 
not rule that Possibility out 


(reverse hypothesis); and (c) some aspect(s) 
of any one of a number of stimulus Situations, typically not represented 


within the investigation, can cause both R, and Е, (or some prototypical 


trace of В, or R,) simultaneously (common-cay, 1 is), These 
basic (independent) directionalities can p БАП hypothesis). 


е (a, 

of the stimulus situation associated with E. 2 и ле AT 
stimulus-mediating response in turn can bring about R ) and/or (b) com- 
bined (initial and reverse tendencies can wor reciprocal] oe fitis] 
and common-causal, or for that matter all three can work si D usly). 

In the Eron et al. study, elaborations аге created сан > ane the 
two responses of interest are measured at two di Tous i edis 
Those elaborations tend to mask the simplicity of the three.diy, i p та 
fact. It is not surprising to find that early aggression © E xcd as 
aggression. We would expect behaviors representing aaie ети, e 
modes of adjustment to show some consistency over time WE pes 
prising, to some degree, is the close to zero Correlation Between, early 
preference for viewing television violence and later Men о 
there is what we consider to be a one-sided elaborati се. 
hypotheses which include, for example, the se. 


ear what we mean when we use 


É 85 continuit 
aggression causing both later aggression and later prefer, 


= Ба епсе 1 
see as a combination of continuity and the reverse hyp (which we 


Othesis) АП five 
айы... № 
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of Eron et al.’s causal paradigms are variations of either the initial or 
the reverse hypothesis. Not considered at all are any one of a number 
of common-causal hypotheses. (We emphasize number because among 
the three basic directionalities, only the common-causal can have more 
than one causal agent.) Such a common-causal variable would not come 
from among the two pairs of responses (preference and aggression) since 
those must be dependent variables according to the common-causal para- 
digm. Thus we see an overabundance of initial and reverse hypotheses 
and an underabundance (meaning complete lack) of common-causal hy- 
potheses in which some factor other than one among the four response 
measures is the common-causal factor. A situation has thus been created 
wherein one can easily get the impression that because 2 possible causal 
hypotheses are considered and » — 1 of those are rejected, the remaining 
one must be (probably) correct. This is true especially when a vicarious- 
reinforcement model is invoked to show what mechanisms might under- 
lie the nonrejected hypothesis and especially when the n — 1 hypotheses, 
by virtue of their lack of independence, are numerous enough so that 
with each rejection the accumulated effect becomes such as to “force” 
a win-by-default conclusion. 

Let us switch from the abstract to the concrete and look at a single 
though complex possible directional paradigm that combines both an 
initial and a common-causal hypothesis. Suppose that one common-causal 
factor and/or another produces in at least some male children an unspeci- 
fied state or predisposition such that (a) in certain situations, aggressive 
behavior is more likely to occur and (b) the viewing of aggressive be- 
havior on television (and elsewhere) is more reinforcing and therefore 
more likely to occur. Thus far, the supposition, though lacking in detail, 
is sufficient to account for the т of .21 cited above. Suppose, however, 
that in addition, the high-“predisposition” boys who do watch more tele- 
vision violence become as a direct result even more aggressive over time, 
perhaps explainable in terms of the modeling effect cited by Eron et al. 
This would account for the .10 differential between early preference on 
the one hand and early versus later aggression on the other hand (which 
amounts to slightly over 5% “accounted-for” variance over the initially 
roughly 4%% found earlier in time). An alternative explanation for the 
differential, namely, that positively reinforced early aggression can lead 
to an increase in subsequent aggression, is equally tenable. An additional 
alternative hypothesis, following Feshbach and Singer (1971), cited by 
Eron et al., is that television viewing has not only a short-term effect 
in the opposite direction but also a long-term effect in the opposite direc- 
tion, which in connection with the first alternative hypothesis suggests 
the possibility that if television viewing were absent altogether, the r 
of .31 might be substantially higher. 

We conclude from our analysis that Eron et al.’s “demonstration” of a 
“probable causative influence” is no demonstration at all and that, there- 
fore, Conclusion 6 cited above is gratuitous. Further, we conclude that 
the positive correlation supporting Conclusion a may well be masking, 
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in terms of the “probable” initial hypothesis, a negative causal relation- 
ship. In a more general reference frame we add that causal analyses in 
R-R studies, although potentially fruitful, can easily lead to a spuriously 
high level of confidence in a given hypothesis and to a failure to appre- 
ciate the various combinations of cause-and-effect directionality that 
may be operating simultaneously in the extralaboratory world. This raw 
situation becomes even more raw when the directions to be taken for 
practical application to the problems of society depend on the outcome of 
such causal analyses. 


REFERENCES 
Eron, L. D., Huesmann, L. R., Lefkowitz, M. M., & Walder, L. 0. Does television vio- 
lence cause aggression? American Psychologist, 1972, 27, 253—263. 


Feshbach, S., & Singer, В. D. Television and aggression. San Francisco: Jossey- 


Bass, 
1971. 
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CAUSE OF AGGRESSION 
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ask, “Who started fights over nothing?” or “Who used to say mean 
things?” There are a number of reasons why this measure is inadequate 
for evaluating current aggression. First, a person may have had little 
or no contact with his former classmates. In the Eron et al. study, nearly 
half of the original subjects no longer lived in the area. It is quite likely 
that an individual was rated by peers he had not seen in years. A more 
important problem was that the structure of the questions made it impos- 
sible to make inferences about aggressive behavior at present. Respon- 
dents were asked about their peers’ previous behavior but not about their 
current behavior. Still another problem was that mothers’ reports were 
used to determine each child’s favorite programs in the third grade. 
These reports are clearly incompatible with self-report measures used to 
make the same assessments in the 10-year follow-up. Thus the results of 
the Eron et al. study should be interpreted with some caution. 


Robert D. Singer and I recently have completed a comprehensive 


review of the television and aggression literature. We have found that 


the majority of the experimental studies showed that witnessing violence 
can instigate “aggressive” behavior. These experiments, however, most 
frequently gained their effectiveness through the intentional arousal of 
subjects and the use of dependent measures that removed ordinary запс- 
tions against aggression. Instigation effects were rarely found in studies 
or experimental conditions in which the subjects were not aroused inten- 
tionally. When the measure of aggression has been some naturally occur- 
ring behavior, it has been shown that television violence either has no 
effect (Feshbach & Singer, 1971; Siegel, 1956) or only affects children 
who were initially highly aggressive (Stein, Friedrich, & Vondracek, 
1971). Thus we would argue that the link between televised violence 
and aggression has not been established clearly. 
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COMMENT: TELEVISION AND AGGRESSION: 
A COUNTERARGUMENT 


Dennis Howitt, University of Leicester, England 


Eron, Huesmann, Lefkowitz, and Walder (1972) have offered evidence 
to suggest that television violence causes aggression. They repeat well- 
founded criticisms of the experimental research into the effects of ex- 
posure to television violence and argue that the longitudinal survey 
approach to the problem has several advantages, especially when pro- 
vision is made for causal explanations. The present note is intended to 
indicate some of the limitations of this particular piece of research against 
the backdrop of the research literature in this area, 
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Table 1 
Mass Media Exposure and Aggression Studies Summarized 
IMPORTANT 
SOURCE METHOD RELATIONSHIPS 
Cressey & Thrasher | Compared delinquents and Delinquents attend more 
(1933) normals on cinema attendance. | frequently. 
Pfuhl (1960) Examined relationship between | Movie attendance and exposure 
violent media exposure to “dangerous” comic books 
(cinema, books, radio, correlated with delinquency. 
television, and comic books) 
and self-reported delinquency. 
Halloran, Brown, & | Considered probationers and Probationers and controls did not 
Chaney (1971) controls in terms of their differ in amount of viewing. 
television usage. 
Hoult (1949) Compared the comic book Comic book exposure highest 
exposure of delinquents and in delinquents. 
nondelinquents. 


television violence (which is somewhat in line with Halloran, Brown, 
and Chaney’s 1971 finding that delinquents tended to prefer what might 
be termed exciting programs). Since aggressive programs tend to be 
very popular (witness the viewing figures of television programs) and 
since a heavy viewer must watch nearly all programs rather than dis- 
criminate, a ready explanation is available of why aggressive youngsters 


watch less television and have a tendency to have greater preferences for 


aggressive programs. Certainly, the research of Eron et al. is interest- 


ing, but in the total context of mass media research it seems unlikely 
that their one longitudinal study finally could answer the question of the 
effects of television violence on real-life aggression. 
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COMMENT: WEAKNESSES IN THE TELEVISION- 
CAUSES-AGGRESSION ANALYSIS BY ERON ET AL. 


Herbert Kay, Herbert Kay Research, Inc., 
Moniclair, New Jersey 


In asking, “Does Television Violence Cause Aggression?” Eron, Bae 
mann, Lefkowitz, and Walder (1972) concluded that it probably does: 


By the use of cross-lagged correlations, partial correlations, and multiple regression, 


it was demonstrated that there is a probable causative influence of watching violent 
television programs in early formative years on later aggression [р. 263]. 


But there are serious weaknesses in the report that the authors failed 


theory in general. 


Although both boys and girls were studied, the authors’ conclusion is 
supported by data only from the boys, not from the girls. They reported 
rs between (a) assumed preference for violent television shows in the 
third grade and (b) peer-rated aggressive behavior in the thirteenth 
grade of .31 for boys but —.13 for girls. Their brief explanation for this 
curious result seems illogical and internally contradictory. They said 
that it 


corroborates the Bandura 
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probably has to do with t 
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If they do not condone it, then exposure to “aggressive cues” such as 
television violence would not increase the production of aggressive be- 
havior, as Eron et al. themselves suggested. But this analysis robs tele- 
vision violence of any real causative role. 

‚ Moreover, their own data accord with this analysis. If viewing of TV 
violence were а cause of aggressive behavior, greater “socialization” of 
girls than of boys might have involved a lower incidence of aggression 
for girls but not a lower correlation between whatever aggression was 
observed and television viewing. Yet Eron et al. reported virtually zero 
correlations between television violence and both early and later aggres- 
Sion among girls. 

They pointed out that nonnormality of the distributions could have held 
down the size of various correlations. But they reported an even higher 
correlation between early and later aggression among girls (.47) than 
among boys (.38). So nonnormality does not explain why the girls’ aggres- 
Sion (both early and later) had roughly zero correlation with early tele- 
vision viewing. 

The only conclusion that seems tenable is that both the data and the 
socialization hypothesis presented by Eron et al. confute rather than sup- 
port the validity of their conclusion that early viewing of television vio- 
lence probably induces later aggression. 


There is a further internal discrepancy. As part of their socialization 


hypothesis, they alluded to “observation of models” performing aggres- 
Slve behavior. The context implied that the modeling is done only by 
television characters, but it is well accepted that parents are fully as 


important, if not more so, as models. 

The investigators reported that measur 
were in third grade) of father’s and mother’s aggressiveness, and pa- 
rental punishment or rejection of child, yielded multiple-correlation 
beta weights of magnitude .09—.03, and that their measurements of 
these variables failed to improve the multiple-correlation prediction of 
thirteenth-grade aggression by more than .02 correlation points @.е., 
from .51 to .БЗ). 

What is strange is not just 
tenable version of a modeling hypoth 
aggression in the home has no materia 
sion—a finding certainly not supported b; 
child" syndrome, for example. 

The most likely resolution of these paradoxes is that the validity of 
the parental-aggression data is faulty. Mother's and father's aggression 
was determined by the Walters-Zak test. But no validity data or source 


reference for that test were provided by the investigators. Nor could I 
find it listed in the Mental М еаѕитететіѕ Yearbook or Personality Tests 
then, it is of unknown valid- 


and Reviews, both by Buros. Theoretically, 
ity to us. But because most psychometric devices, reports Zedeck (1971, 
p. 295), still have predictive validities that rarely exceed .50, the statis- 


tical odds are that the data from the Walters-Zak test likewise are not 
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likely to have greater than .50 validity—which, of course, means 25% 
accuracy. 

The source of other aggression-oriented data on parent-child relation- 
ships was described as “Mother’s report” or “Parents’ report.” With no 
further explication, we must presume that the investigators relied on 
the parents to report such matters accurately—that the parents neither 


ast-ratings services 
doubt on the validity 


report in preparation) i 
which both the mother па G Y nun consc 
wed. Prior research indi- 


while the interviewer 
requested to go into another 
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child was being interviewed. The purpose was to minimize the danger of 
the mother and child contaminating each other's responses. 

When asked, program by program, whether or not the child watches 
that show “at least once in awhile,” the mothers’ answers agreed with 
their children’s answers only 62% of the time for the 22 shows. An incon- 
gruent answer was defined as a mother’s “don’t know” or a viewing asser- 
tion not supported by the child’s response; both types of responses reflect 
an inability of the mother to report the child's viewing. Because of the 
prior survey findings that led to the present study, as well as other data 
(e.g., the general unreliability of second-person “hearsay” reports), most 
of the incongruency was attributed to mother error rather than child 
error. 

Analysis by sex of child revealed a significant F ratio between the 
mother-boy and mother-girl variances (due, it seems likely, to deviant, 
lazy, blanket “don’t know” responses from a few girls’ mothers) but no 
significant differences between the means. The most defensible conclu- 
sion thus far is that even if mothers should prove slightly more accurate 
in the relative sense in reporting boys’ than girls’ viewing, this difference 
would seem minuscule compared with the absolute magnitude of report- 
ing error for children of either sex. 

Using this one study as a benchmark, it appears reasonable to con- 
clude that the error variance in mothers’ reports of the third-grade child’s 
viewing preferences was at least two or three times larger than the co- 
efficients of determination obtained by Eron et al. in relating early view- 
ing to later aggression among the boys (10%) and the girls (2%). The 


Possibility of a significant interaction between third-grade television 
bles is large enough to suggest great 


reporting error and other varia ig ge 
caution in interpreting the correlations between early “viewing” and 


later aggression. 

Still another problem is that programs apparently were categorized as 
being violent or nonviolent on the basis of manifest rather than latent 
content—that is, without determining whether the children would react 
to the programs as portraying realistic aggression. Himmelweit, Oppen- 
heim, and Vince (1960) concluded that, unlike crime thrillers, violence 
in Westerns is stylized, not realistic, and “it is likely that the majority 
of children can enjoy them without fear by the time they are about 75 
With the "violent" programs made up of a mixture of unrealistic violence 
and realistic violence, we have still another confounding varible which 
may have affected the findings. (The investigators did assert that the 
19-year-olds who watch a lot of crime and Westerns use those shows as 
realistic models for their own behavior—but the supporting data were 
not actually relevant.) 5 | 

In short, there were so many serious loopholes not mentioned in the 
report by Eron et al. that one is justified in rejecting as insufficiently 
supported their conclusion from this particular study that viewing of 
television violence when young probably causes later aggressive behavior. 
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With any socially relevant research project like the Surgeon General's 
study of television violence, there is always the danger that passions will 
dominate rational thought. The specters of exploitation of children or 
censorship of the media loom large in the back of many peoples' minds. 
Yet we have not dealt with such issues nor have we proposed any solu- 
tions, and we do not wish to reply to criticisms of this nature. Instead 
we shall stick to questions of scientific substance. 

The major conclusions of our article were that 

the weight of evidence from this study when coupled with previous laboratory 


studies supports the theory that during a critical period in a body's development, 
regular viewing and liking of violent television lead to the formation of a more 


aggressive life style [p. 262] 


and 
bable causative influence of watching violent 


it was demonstrated that there is a pro 
years on later aggression [p. 263]. 


television programs in early formative 
These conclusions were based on results obtained from applying a 
variety of specialized statistical techniques (e.g., cross-lagged panel 
correlations, path analysis) to data collected on 427 children when they 


were in the third grade and again one year after they had graduated from 


high school (called the thirteenth-grade period). The major findings, 
п, are presented in Figure 1 and 


relating television violence to aggressio: 

Table 1. Table 1 did not appear in the original article, but it shows clearly 
how post-high-school aggression increases as а function of television 
violence, regardless of the subject’s level of aggression in the third grade. 


THE WEIGHT OF EVIDENCE 


In evaluating our conclusions, it is important to note our qualifying 
phrases “when coupled with previous laboratory studies” and “probable 
causative influence.” Yet many critics who feel our conclusion is too 
strong respond as if it were the only study in this area. As Chaffee (1972) 
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Table 1 
Mean Aggression Scores for Boys in the Thirteenth Grade 
PREFERENCE FOR TELEVISION VIOLENCE 
IN THE THIRD GRADE 
LEVEL OF 
AGGRESSION IN High Low x 
THE THIRD 80% 1 
GRADE (28095) Medium (<17%) 
High (>75%) 165.8 116.9 60.6 114.4 
(N = 13) N = 27) (N = 5) (N = 45) 
Medium 103.3 89.5 58.7 83.8 
(N = 15) (N = 59) (М = 17) (N = 9) 
Low (<26%) 110.8 19.0 32.4 54.1 
N = 8) (У = 31) (М = 9) (N = 48) 
X 126.6 15.1 50.6 
(N = 36) (N = 117) (М = 31) 
Note. Test of television violence in the thi 


rd grade: F = 6.63, df = 2/175, p <.005. 
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Table 2 
Correlations between the Peer Ratings of Aggression and the 
Thirteenth-Grade Self-Ratings of Aggression for Boys 


THIRTEENTH-GRADE THIRD-GRADE THIRTEENTH-GRADE 
SELF-RATINGS PEER RATING PEER RATING 
OF AGGRESSION OF AGGRESSION OF AGGRESSION 
Antisocial behavior .444 .478 
Aggressive habits .159 .503 
Total aggressive environment .165 :508 
MMPI-49-sum .206 .391 


As Table 2 reveals, thirteenth-grade peer-rated aggression correlates 
much more highly with every thirteenth-grade self-report measure than 
does third-grade aggression. Hence the argument that the thirteenth- 
grade score is only a remeasure of third-grade score does not seem to 
hold much water. But Kaplan (1972) also argued that “it is quite likely 
that an individual was rated by peers he had not seen in years.” Here, 


Kaplan is simply wrong. Perhaps he misunderstood the design of the 


study. Only those 427 subjects who were reinterviewed in the thirteenth 


grade were rated or used as raters. All of these subjects still lived in the 
area. The great majority of them graduated from high school in the same 
class. Only a small number might not have been seen by the other sub- 
jects “in years.” Finally, since the third-grade ratings had been done 
only by children within the subject’s classroom, the thirteenth-grade 
ratings were performed by а quite different set of raters from the third 
grade, many of whom did not even know the subject at the earlier time. 


“VAGUE” METHODOLOGICAL PROBLEMS 

Kaplan’s Comment included the statement that “several of the studies’ 
design problems have been described in a recent report to the Surgeon 
General by the Surgeon General’s Scientific Advisory Committee on 
Television and Social Behavior.” If Kaplan would care to state a particu- 
lar problem, we would be happy to respond, but we can hardly answer 
such a vague statement as this. If one reads the critiques of the two 
consultants hired by the Surgeon General’s Committee to analyze our 
results (Kenny, 1972; Neale, 1972), one finds some legitimate criticisms 
but general agreement with our conclusions. However, Kaplan may be 
excused if he has been misled by Chapter 7 of the summary of the Sur- 
geon General’s report. The distortions in that chapter have been detailed 
elsewhere (Pastore, 1972), and it has been reported (Paisley, 1972) 
that the chapter was written almost entirely by a consultant for CBS 


Television. 
REPORTS OF TELEVISION BEHAVIOR 


‘Another criticism leveled by Kaplan and elaborated on by Kay (1972) 
concerns the fact that the mother’s reports were used to ascertain the 


VIOLENCE ON TELEVISION: HOW HARMFUL? 
176 


: Р ань 
child’s favorite television programs in the third grade, ше К 
own reports were used in the thirteenth m en sti dir 
learly incompatible with self-r neasure 
[mother's] reports are с » hom this incompatibility is 
used . . . in the 10-year follow-up.” To whom s de 
i the other hand, о 
d to be clear is unclear. Kay (1972), on th Е 
peri deal of argument to Support his contention that the о : 
es most of Kay's argument is base p 
one pilot study of his own that he has never published. In this еле vid 
he apparently gave a list of 22 programs to 40 mothers ац иена E 
m “at least once in a while." He repo: А 
5 answer agreed with her child’s on only 


preference for viewing TV 
eighth-thirteenth correlation was .21. The existence 
refutes the claim that the m 


unvalidated and unreferenced is not true. 


MALE-FEMALE DIFFERENCES 
Kay’ 


our assertion that “the frequency o 
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females shown on television is somewhat low [Eron et al., 1972, p. 261]," 
certainly lower than for males. Hence “the female viewer is less likely 
to encounter an aggressive model to imitate [pp. 261-262]." It is this 
lack of models, coupled with differential socialization, that mitigates the 
violence effect in girls. 

Kay's (1972) two remaining criticisms, which are respectively irrele- 
vant and untrue, seem to be (а) Himmelweit, Oppenheim, and Vince 
(1960) said "it is likely that the majority of children can enjoy them [vio- 
lent television Westerns] without fear [italics added] by the time they 
are about 7"; (b) if Eron et al. admit that parents have any influence on 
how television violence affects the child (which, of course, we do), then 
this “robs television of any real causative role" (which, of course, is 
untrue). 


THIRD VARIABLES 


Perhaps no other criticism has been directed more frequently against 
our study than that a “third” variable causes both aggression and a pref- 
erence for viewing of violent television. Becker’s (1972) Comment focused 
on this point. His central thesis was that we considered only hypotheses 
of the nature “viewing television violence causes aggression” or “aggres- 
sion causes viewing of television violence” and ignored the third possi- 
bility that some other variable caused both. We would be the first to 
admit that we have not covered the scope of all third-variable hypotheses 
but neither have we ignored all. The correlations between a preference 
for viewing violent television and aggression were shown, for example, 
to be mostly independent of a child’s aggression level in the third grade, 
socioeconomic status, IQ, parental punishment, parental aggressiveness, 
parental aspirations for child, parental mobility orientations, and hours 
of television watched by the child. 

Becker argued that the “n — 1" hypotheses we reject are *small" com- 
pared to the “number” (and by “number” he must mean infinity) of hy- 
potheses that exist. Since (n — 1) divided by infinity is zero, he would be 
right no matter how many we investigated (short of infinity); though 
exactly what this proves is unclear. If Becker would propose some real 
alternative theories, we would gladly consider them. Becker’s one at- 
tempt at a “concrete” alternative included no suggestion as to what third 
variable might be relevant and is untestable as stated. We find Becker's 
concluding criticism that our “demonstration was not a demonstration 
at all” not a conclusion at all but a non sequitur. It could more aptly have 
been applied to his own wishful conclusion that in our data "the positive 
correlation . . . may well be masking . . . а negative causal relationship 


between television violence and aggression. 


CONCLUSION 


In any longitudinal study, given the opp 
vestigators would probably change many 


ortunity to repeat it, the in- 
elements of the design. We 


178 VIOLENCE ON TELEVISION: HOW HARMFUL? 


certainly wish we had done many things differently. Nevertheless, we 
do not find any of the criticisms leveled against the study, either alone 
or in concert, sufficient for us to change our conclusions. Adding up the 
evidence, we find no reason to change our conclusion that the most plausi- 
ble hypothesis to explain our findings (and the other studies and experi- 


ments in this area) is that early exposure to television violence probably 
has a causal influence on aggression in males. 
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6. THE BATTERED CHILD: 


Who Is Really to Blame? 
What Can Be Done? 


ma disturbing kind of human aggression is child abuse. Surprisingly 
very à , psychologists have not studied child abuse or child abusers 
to brin uch. But in recent years considerable effort has been expended 
alise g this problem out into the open, to determine the extent of child 
a "mid to try to find some explanations for it. The following two 
David Ue e rather different points of view with regard to child abuse. 
атаб x believes that it is the "social rather than the psychological 
Jit n eristics of the parents” to which child abuse should be attributed. 
тЫ апа David Rigler, on the other hand, emphasize the very 
dova нед characteristics of the child-abusing parent that Gil plays 
and S ^ ich article do you feel makes the better case? Or are both Gil 
à onn netta and Rigler correct, but simply pointing to different aspects of 

plex problem involving both sociologieal and psychological causes? 


at are the practical implications of these two points of view? What do 
be some children who are 


ү think of Gil's comment that there may 
е likely than “normal” children to provoke abusive attacks against 


th 
emselves? How could one test such а hypothesis? 
reports, how serious do you think is the 


| mentions? Are middle-class child abusers 
could lower-class abusers? Why? How 
one find out? (It is, incidentally, much easier to ask these questions 

rying to devise answers may 


ВТуе you a feel for what it would be li 
wees and Rigler suggest that the best predictor о 

a child abuser is whether or not he or she was a 

for purposes of argument, 


ie а | that this is true, 
sh with the causal factors that Gil cites for child abuse? 


bused as a child. 
how would this 
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Spinetta and Rigler ask a key question: “Why is it that most deprived 
families do not engage in child abuse, though subject to the same есо- 
nomic and social stresses as those families who do abuse their children? 

Granting that most human behavior has many contributing causes, 
what is Spinetta and Rigler’s main point about the causes of child abuse? 


Can you think of possible means of identifying and preventing potential 
child abuse? 


PHYSICAL ABUSE OF CHILDREN: 
FINDINGS AND IMPLICATIONS 
OF A NATIONWIDE SURVEY 


David G. Gil, Florence Heller Graduate Schoo! 
for Advanced Studies in Social Welfare, 
Brandeis University 


One of the basic obstacles to the development of effective counter- 
measures against accidental injury is the fact that the intensity of public 
concern over a particular type of injury-producing episode bears little 
relationship to its prevalence or to the seriousness of its consequences. 
An aircraft accident that takes 40 lives receives national attention 
through the mass media, whereas the fa 
highways on th 


unnoticed. The entrapment of a handful of 


accident countermeasures, 


Gil, David G. Physical abuse of children: Findings and implications of a nationwide 
survey. Pediatrics, 1969, 44, 857-864. Reprinted with permission of the publisher. 
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childhood injury. Unfortunately, legislative and administrative routines, 
once established, tend to perpetuate themselves. It will be interesting 
to note whether, in the light of the epidemiological findings, the reporting 
requirements will be withdrawn. 

Another major contribution of this paper is its attribution of child 
abuse to the social rather than the psychological characteristics of the 
parents. Although the past decade has witnessed an increasing concern 
with poverty and the poor, virtually no attention has been devoted to the 
differential vulnerability to accidental injury of children in various socio- 
economic levels and subcultural groups. Our increased understanding of 
cultural deprivation and of lower-class life style gives us ample material 
for speculation about this differential, but such speculation is hazardous, 
and systematic empirical research must be undertaken. 

Children in our own society and in many others have always been 
subjected to a wide range of physical and nonphysical abuse by parents 
and other caretakers, including teachers and child-care personnel and not 
infrequently by society as a whole. 1 Abusive treatment of children seems 
to be inherent in the basic inequality of physical make-up and social status 
between adults and children and in culturally determined permissive or 
even encouraging attitudes toward the use of physical force as a legiti- 
mate means for rearing and socializing the young. | . 

, In the course of centuries, progress has been made in protecting the 
interests and well-being of children in comparison with earlier periods 
When parents tended to wield absolute power over their offspring s life 
and death and when children were frequently abandoned to die of expo- 
Sure, legally bartered and sold, mutilated to enhance their pity-neitine 
appeal as beggars, or thrashed by schoolmasters with ferrules, rods, 
and the cat-o'-nine tails.” Despite considerable social and legal progress 
In the protection of children, the problem of their physical abuse cl 
tinues to cause serious concern in present-day American society. г 
extent of this concern is reflected in the fact that since 1940 several 
hundred professional articles and books concerning this po 
Were published by roentgenologists, pediatricians, т е ir 
tors, social workers, social and behavioral scientists, and Erud 
umerous papers on this subject have been presented at pro nm 
conferences of social workers and physicians № recent ыза 
Mass media have provided ample coverage for this О геп Tas peeti 

Professional and public interest in physical abuse of chi eis ee 
Stimulated by observations of roentgenologists of е : di но ин 
Multiple fractures of the long bones in young children v ical studies 
tion with subdural hematomas." Subsequent n. e airis m 
dus cases by social айа an LT my Om: 

ildren's hospitals, clinics, and © ag i 
munities inane the country led to the suspicion and е << 
firmation that these unexplained injuries of children ee andl maps 
У their own parents and caretakers.5 7? As а ens bed as a fairly 
other studies, physical abuse of children came to be recogn 
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‚ Widespread phenomenon which often resulted in serious, irreversible 
damage to the physical well-being and emotional development of children 
and which at times was even the immediate cause of their death. 

The types of injuries inflicted upon children were found to range from 
minor, superficial bruises and cuts through burns, scaldings, fractures, 
and internal injuries, to intentional starvation, dismemberment, and 
severe injuries to the brain and central nervous System. The circum- 
stances under which the injuries were inflicted were equally varied and 
ranged from simple disciplinary measures through uncontrollable angry 
outbursts, often under the influence of alcohol, to premeditated mur- 


› mày occasionally be more likely to provoke abusive attacks 


ncreasing interest in the phenome- 
Sicians, educators, social workers, 
» aS Well as of the public at large, led during the 1960s to the 


that “few legislative proposals in the history of the United States have 
been So widely adopted in so little time."17 
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P rique (3) the professional groups that are required to make reports, 
) the authorities on the local and state level to whom reports are to be 
[oves (5) the rights and responsibilities of the persons making the report, 
а Г responsibilities of the authorities receiving the reports, (7) the 
ii shment of central registries for the state, and (8) the penalties for 
А Ing to report in accordance with the law. The differences between the 
bemus state laws concerning the foregoing items are described and 
a ulated for easy reference in the already cited publications of the U.S. 
Children’s Bureau.!5: 16 
go been mentioned that a secondary purpose of reporting legisla- 
P n is to provide a broad basis for the study of the reported phenomenon. 
rior to the enactment of reporting legislation it was possible to study 
relatively small local samples of abused children in hospitals, clinics, and 
| protective agencies. Once all the states had enacted reporting legis- 
ation it became feasible to design a nationwide epidemiologic study of 
all legally reported incidents of child abuse. Such a study was launched 
n 1965 by Brandeis University in cooperation with the U.S. Children's 
ureau under the direction of this writer.!? 19 The purpose of the study 
Was to obtain systematic information on the number of legally reported 
incidents of child abuse throughout the United States, the distribution 
patterns of these incidents, the characteristics of the children, families, 
and perpetrators involved in the incidents, the cireumstances surround- 
Ing the incidents, and the measures taken by health, welfare, and law- 
enforcement authorities in dealing with the reported incidents. All cases 
reported throughout 1967 were to be studied, and all cases reported 
during that year in 38 representative sample communities were to be 
Subjected to a more comprehensive investigation and analysis. Р 
Before examining some of ће findings of the nationwide survey, it 
should be noted that legally reported incidents may constitute only part 
of the total universe of child abuse incidents, the real size and charac- 
teristics of which are likely to remain unknown. Since abuse incidents 
tend to occur in the privacy of a family's home, not all of them do come to 
the attention of individuals or institutions who are required to report, and 
Not all incidents that are known to such persons and institutions are 
actually reported. It is possible also that the selection of incidents which 


are eventuall is subject to systematic bias. It is known, for 
4 тараеп = ч ivate practice are less ready than 


Instance th ide » 
‚ that hysicians in pri й 
eng ham ort incidents that come to their atten- 


~ 4 eprived families. Such factors " 
cedures and their effectiveness may also t 
| f limitation: 
=з з requirements and reporting rates. Hae 
findings, the survey must be ept mine 


Duri nda я 
"ring 1987 а total of 5,993 physically abused children W 


thro, Е 
под P local channels. The geographic distribution 0 
his ees oo imately 10 the: pope 


? 
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among the states. Reporting rates remained fairly stable throughout 
ar. | 
в more than half the children were boys. Their ages => a 
follows: about one-quarter, under 2 years of age; one-quarter, hater 10 
2 and 6 years; one-quarter, between 6 and 10 years; and one-quarter ES 
years or older. Two-thirds of all the children were white, and about eo 
third were non-white. However, in the metropolitan areas less than 40% 
of the children were white, uu А - 
Nearly 80% of the children displayed noticeable deviations in вос : 
and behavioral functioning during the year preceding the abuse incident е 
Among the school-age children, 79% were in grades appropriate for Ше 
age; 13% were in grades below their age level or in classes for retarde 
children; and about 3% had never attended school, Че" 
Approximately 60% of the children had been abused prior to the inci- 
dent reported in 1967, Nearly 30% of the 


out a father or father Substitute, and about 20% had a stepfather in the 
home. Only 46% 


lived with their own father, whereas 87% lived with 
their own mother. Over 10% of the children’s mo 
20% were Separat 


with a spouse, 

The educational level and the o 
very low. Only 52% of the father: 
time of the incident about 12% o 
about three times as high as th 


income of the families was very low. Nearly 38% earned less than $3,500 
per year, and about 52% earned 


Nearly 48% of the children were abused by their mothers, and over 
39% by their fathers, However, since nearly 30% of the children lived 
in fatherless homes, the involvement rate in abuse of fathers or father 
substitutes who were living in the children’s homes was actually higher 
than that of mothers. In over 17% the abuse Was committed by a male 
caretaker while the mother was temporarily absent from her home. 
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| Тће injuries of the children were rated “not serious” in over 53% of the 
ot In 37% they were rated serious without permanent damage. 
iris y 596 of the injuries were rated serious with permanent damage, 
under 4% were fatal. It is known, however, that the fatality rate is 
under-reported because fatalities are not specifically mentioned as re- 
portante in all but three states. Nearly 60% of the children did not require 
ospitalization following the incident, and 25% did not require any medi- 
ү treatment. Over 21% required hospitalization beyond 1 week. № early 
% were considered “battered children” in accordance with the condi- 


tions described by Dr. Kempe and his colleagues (Kempe et al., 1962). 


In over 21% of the incidents the perpetrators themselves initiated 
cident. In 36%, other members 


Se the children subsequent to the in 1 36! e 
16 е victim’s household initiated help. Schools initiated help in over 
% of the incidents, and in over 80% neighbors, visiting relatives, and 
others initiated help. 

Over 36% of the abused children were removed from their families 
Subsequent to the abuse, and in over 15% of the incidents siblings of the 
abused child were also removed. Courts were involved in 46% of the 
incidents; the police in 53%; and social agencies in 87%. Over 13% of the 
Perpetrators were convicted by courts, and over 7% were jailed. 

As to the circumstances surrounding the abuse incidents reported in 
the survey, it was found that 63% developed out of disciplinary action 
taken by a parent. In 34% it was found that the child was psychologically 
resented and rejected by the abusive parent. In nearly 25% the child was 
Considered atypical in his behavior. In over 11% the abuse developed out 
of a quarrel between the parents. In 31% the perpetrator considered 
himself to be an authoritarian person; in over 46% the perpetrator was 
Viewed by others as suffering marked emotional deviations, and in nearly 
5% the abuse involved sadistic components. ‘Abuse coincided with neglect 
In nearly 34%, and in 13% the perpetrator was intoxicated with alcohol 
at the time of the abuse. In 59%, perpetrators were subject to mounting 
Stress as due to life circumstances. | 

s in the foregoing crude classi- 
fication of circumstances of dd up to 100% 
Sinee many incidents fall into more than one class 

The findings of the nationwide survey reported : B | 
ceding nationwide survey of publie knowledge, attitudes, and opinions 
about physical child abuse in the United States suggest the 
conclusions. 21 

The physical abuse of children can be conceptual 
aS a multidimensional phenomenon rather than a uniform phenomenon 
With one set of causal factors. Its pasic level upon which all other factors 
are superimposed is the general culturally permitted use of a measure of 
Physical force in caretaker-child interaction and the related absence of 
clear-cut legal prohibitions and sanctions again 


of interpersonal violence. The next level is determine 
Tearing traditions and practices of different social class 


ualized schematically 
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and nationality groups, and the different attitudes of these groups toward 
physical force as an acceptable measure for the achievement of the child- 
rearing objectives. A third level is determined by environmental chance 
circumstances which may transform an "acceptable" disciplinary measure 
into an “unacceptable” outcome. A fourth level is a broad range of envi- 
ronmental stress factors which may weaken or even paralyze temporarily 
a person’s psychological mechanisms of self-control or impulse control. 
In this way such environmental stress factors may be indirectly con- 
ducive to the uninhibited discharge of aggressive and destructive impulse 
and desires toward children, who are perceived, for real or imaginary 
reasons, to be the causes of stress, The final level is various forms of 
physical, social, intellectual, and emotional deviance and pathology of 
caretakers and/or children in their care, as well as of entire family units 


reported through legal ch 
less “acceptable” discipli 
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ES selection of cases entering official reporting channels. If then, the 
,000 reported cases represent as a group the more severe segment of 
the physical abuse spectrum, it follows that in qualitative terms physical 
abuse is by and large not very serious as reflected by the data on the 
extent and types of injury suffered by the children in the study cohort. 
The classical “battered child syndrome" is a relatively infrequent occur- 
rence. Even if allowance is made for the gross under-reporting of fatali- 
ties, physical abuse cannot be considered a major killer and maimer of 
children. 

It must also be stressed from an epidemiologic perspective that the 
phenomenon was found to be concentrated among the poor; it seems to 
be one aspect of the style of life associated with the poverty syndrome. 
Although it is certainly valid to argue, оп the basis of much evidence, 
that the poor are more likely to be reported for anything they do or fail 
to do and that their over-representation among the reported cases of 
child abuse may be in part a function of this kind of reporting bias, it 
must not be overlooked, nevertheless, that life in poverty generates 
Many additional stressful experiences which, as pointed out above, are 
likely to become precipitating factors of child abuse. The poor are subject 
to the same psychological conditions which may cause violent behavior 
toward children as are the nonpoor; but, in addition to this, they are 
Subject to the special environmental stresses and strains associated with 
Socioeconomie deprivation. Moreover, they have fewer alternatives and 
escapes than the nonpoor for dealing with aggressive impulses toward 
their children. Finally, there is an additional factor, the tendency toward 
More direct, less inhibited expression and discharge of aggressive im- 
pulses, a tendency learned apparently through lower-class socialization, 
which differs in this respect from middle-class socialization. 

Of considerable interest are also findings concerning the troubled 
past history of many abused chil 


dren, their parents, and their abus- 
ers—the relatively high rates of deviance and pathology in areas of bio- 
Psychosocial functioning which was frequently noted during the period 
Preceding the actual abusive incident and the high rate of deviance in 
family structure which is reflected in a high proportion of female-headed 
households and households in which the biological father is absent. 
. Measures aimed at the prevention or the gradual reduction of the 
Incidence and prevalence of specified social phenomena can achieve thei 
Purpose when they are designed and executed in a manner which assur es 
intervention on the causal level. Applying this model of preventive inter- 
Vention to the phenomenon of physical abuse of children and proceeding 
on an as yet untested conceptualization of the etiology of the phenomenon 
Which was presented above, the following approaches can be suggested. 
Since the cultural attitude permitting the use of physical force in child 
rearing seems to constitute the common соге of all physical abuse of 
children in American society, systematic educational efforts aimed at (a) 
£radually changing this particular aspect of the prevailing child-rearing 
philosophy, and (b) developing clear-cut cultural prohibitions and legal 
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sanctions against the use of physical force as a means for rearing children 
are likely to produce the strongest possible reduction in the incidence 


mechanism. No doubt this is o 
of preventing violence toward children 


A comprehensive high-quality, neighbo 


Social security system or from general tax 


1 only to the treatment of acute and chronic 
illness but also to the promotion and assurance of maximum feasible 


r every citizen, 


| у abused children belong 
ту, and in such situations the welfare of the 
child and the family may i 
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and ther. A | 

the St тү rather than a punitive philosophy if they are to serve 

the wastes 2i Dd reduction of the general level of violence and 
g the general level of human well-being throughout our 


entire society. 
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The aim of this review is to bring together professional opinions of this 
decade on the psychological characteristics of the abusing parent in order 
to determine from the most commonly held opinions what generalizations 
can be induced and thus to lay the groundwork for systematic testing of 


hypotheses. 


DEFINITION 

What is child abuse? Kempe et al. (1962) limited their study to children 
who had received serious physical injury, in circumstances which indi- 
cated that the injury was caused willfully rather than by accident. They 
coined the term “battered child” to encompass their definition. Zalba 
(1966), after a brief review of definitions, likewise addressed himself 
primarily to those cases in which physical injury was willfully inflicted 
on a child by a parent or parent substitute. 

Because of the difficulty of pinpointing what is emotional or psychologi- 
cal or social neglect and abuse, and because of the extent of the literature 
on physical abuse alone, this review, following Kempe's and Zalba's 
lead, limits the term "child abuse" to the concept of physical injury to 
the child, willfully inflicted. The review omits studies of parents who 
neglect their children—emotionally, socially, or psychologically —and 


adults who sexually molest them. 


MEDICAL AND LEGAL HISTORY 
aspects of the problem of child 


Literature on the medical and legal 
abuse is extensive. The edited volume of Helfer and Kempe (1968) con- 
tains a general overview, as do the articles by Paulson and Blake (1967), 
Silver (1968), and Zalba (1966). Legal aspects are delineated in De Francis 
(1970), MeCoid (1965), and the various articles by Paulsen (1966a, 1966b, 
1967, 1968a, 1968b). Simons and Downs (1968) gave an overview of pat- 
terns, problems, and accomplishments of the child-abuse reporting laws. 
A thorough bibliography on child abuse was published by the United 
States Department of Health, Education and Welfare (1969). 

This review is not concerned with the medical and legal aspects of 
the problem and refers only to those articles that gave more than a pass- 
ing mention to the psychological and social determinants of parental 


abuse of children. 


REVIEW OF THE LITERATURE 
ect to the same general criti- 


Most of the studies of child abuse are subj 
cism. First, the studies that set out to test specific hypotheses are few. 
Many start and end as broad studies with relatively untested common- 
Sense assumptions. Second, in most studies in this area the researchers 
Used samples easily available from ready-at-hand local populations, and 


thus the samples were not truly representative. We shall have to rely 
9n the convergence of conclusions from various types of sampling to 
establish generalizations. Third, practically all of the research in child 
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abuse is ex post facto. What is left unanswered and still to be tested is 
whether one can determine prior to the onset of abuse which parents 
are most likely to abuse their children or whether high-risk groups can 
only be defined after at least one incident of abuse has occurred. 

In spite of these criticisms, the studies of child abuse do give general 
data that can furnish hypotheses for more rigorous research design and 


for a more differentiated approach to the question of why parents abuse 
their children, 


Demographic Characteristics 
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socioeconomically and racially matched groups on 18 personality ue 
ables. Melnick and Hurley found, among other things, a probable history 
own upbringing. 
Further support for the hypothesis that the abusing parent was once 
an abused or neglected child is found in Bleiberg (1965), Blue (1965), 
Corbett (1964), Curtis (1963), Easson and Steinhilber (1961), Fairburn 
and Hunt (1964), Fleming (1967), Green (1965), Harper (1963), Kempe 
et al. (1962), McHenry, Girdany, and Elmer (1963), Morris, Gould, and 
Matthews (1964), Nurse (1964), Paulson and Blake (1969), Silver, Dublin, 
and Lourie (1969b), and Wasserman (1967). 
In a summary statement, Gluckman (1968 
earlier observers, set up a 10-point differen 
main point, and the point of this section of 


), repeating the findings of 
tial diagnosis category. His 
the review, is that the child 
3 capacity to love is not inherent; it must be 
т development depends on love, tolerance, 
parents were raised without this love and 
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Finally, there are those authors who considered low qo ey 
prime factor in the etiology of child abuse (Fisher, 1958; Simpson, i. a 
1968), although this point is disputed in the findings of Sere 
(1966), Holter and Friedman (1968), Kempe et al. (1962), and 
pom а common motivational factor behind child abuse? Is ч ч 
only опе “type” of abusing parent? Realization that each of the abo 


The first major attempt at a typology was made by Merrill (1962). Be- 


n H 2 in 
cause Merrill’s typology is the most often quoted, it is summarized 1 
some detail, 
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their families. In most of these situations, the mothers were working, 
and the fathers stayed at home, caring for the children. Their frustra- 
tions led to swift and severe punishment, to angry, rigid discipline. 

Two further attempts at classification, Delsordo (1963) and Zalba 
(1967), with slight modifications, can be reduced to Merrill’s categories. 

The use of categories seems simple, unifying, and time saving. If fur- 
ther work can be done in refining the categories, validating them in field 
research, perhaps they or similar clusters shown to be empirically valid 
can be used as an aid in the determination of high-risk parents. 

In this section we have seen a conglomerate picture of parental moti- 
vational and personality variables, with one author’s attempt to cluster 
the characteristics into a workable unity. One basic fact of agreement 
emerges from the studies in this section. The authors feel that a general 
defect in character—from whatever source—is present in the abusing 
parent, allowing aggressive impulses to be expressed too freely. During 
times of additional stress and tension the impulses express themselves 


on the helpless child. 


CRITIQUE OF A SURVEY 


" Of the studies surveying the demographic characteristics of families 
in which child abuse has occurred, the most extensive in scope was the 
Gil (1968а, 19680, 1969). In 1969, Gil 


national survey undertaken by 
reported that the phenomenon of child abuse was highly concentrated 
among the socioeconomically deprived segments of the population. Con- 
cluding that “physical abuse is by and large not very serious as reflected 
by the data on the extent and types of injury suffered by the children 
in the study cohort [p. 862],” Gil placed his intervention strategy in the 
general betterment of society. For Gil the cultural attitude permitting 
the use of physical force in child rearing is the common core of all physical 
abuse of children in American society. Since he found the socioeconom- 
leally deprived relying more heavily on physical force in rearing chil- 
dren, he recommended systematic educational efforts aimed at gradually 
changing this cultural attitude and the establishment of clear-cut cultural 
Prohibitions against the use of physical force as a means of rearing chil- 
dren. He viewed this educational effort as likely to produce the strongest 
Possible reduction in the incidence and prevalence of physical abuse of 
children, | 
For Gil, child abuse is ultima: ult of chance environmental 
actors, While admitting to vario social, intellectual, 
and emotional deviance and pat and in ee a 
Units to which they belong, Gil stressed a global control of environmenta 
actors as the solution to the problem of child abuse. He suggested: (a) 
6) elimination of poverty from the midst of America’s «eur = 
ilability i i urces aime! = 
the availability їп every commu тт gend 


tion and alleviation of deviance and pathology; 
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prehensive family planning programs and liberalized legislation concern- 
ing medical abortions, to reduce the number of unwanted children; (d) 
family-life education and counseling programs for adolescents and adults 
in preparation for and after marriage, to be offered within the public 
school system; (е) a comprehensive, high-quality, neighborhood-based 
national health service, to promote and assure maximum feasible phys- 
ical and mental health for every citizen; (f) a range of social services 
geared to the reduction of environmental stresses on family life; and 
(9) а community-based system of social services geared to assisting 
families and children who cannot live together because of severe rela- 
tionship problems. Gil’s ultimate objective is “the reduction of the general 
level of violence, and the raising of the general level of human well-being 
throughout our entire society [р. 863]." 

While one must praise the 
and the ultimate objective of 
raising the general level of h 


in reducing the amount of 
à mains the problem, insoluble at the demo- 
graphic level, of Why some parents abuse their children, while others 


long-range value in the lessening of the prevalence of ch 
erally agreed upon and finds Support throughout the ] 
ever, most of the authors explicitly caution against considering abuse, 
as does Gil, as a function Solely of educational, occupational, economie, 
or social stresses. This point is made by Adelson (1961), Allen eb ai 
(1969), EImer (1967), Fontana (1968), Helfer and Pollock (1967), Holter 
and Friedman (1968), Kempe (1968), Kempe et al. (1962), Paulson and 
Blake (1967), Silver et al. (19692, 1969b), and Steele and Pollock (1968). 

The great majority of the authors cited in this review have pointed 
to psychological factors within the parents themselves as of prime im- 
portance in the etiology of child abuse. They see abuse as Stemming from 
a defect in character leading to a lack of inhibition In expressing frustra- 
tion and other impulsive behavior. Socioeconomie factors Sometimes 
place added stress on the basic weakness in personality structure, but 
these factors are not of themselves sufficient or necessary causes of 
abuse. 


ild abuse is gen- 
iterature. How- 
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CONCLUSIONS 


The purpose of this review has been to bring together the published 
professional opinions on the psychological characteristics of the abusing 
parent, in order to determine from the most commonly held opinions 
what generalizations can be induced and thus to lay the groundwork for 
more systematic testing of hypotheses. 

The psychologist, both as a specialist in the functioning of the human 
as an individual, and as a scientist trained in research methodology, is 
in a unique position to test the hypotheses raised by professionals in the 
fields of medicine and social work, in the study of the personality charac- 
teristics of the abusing parent. Ne 

Certainly, one would hope that research can eventually develop criteria 
to distinguish those inadequate parents who, with professional help, can 
meet the needs of their children, from those who cannot. We need even- 
tually to be able to identify the high-risk families prior to the onset of 
abuse, but should be satisfied for the time being if we can help deter- 
mine after the fact of abuse which families must receive the most atten- 
tion to assure the further safety of their child. 
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NOTES 


1. Gil's book reporting his national findings (Violence against children: Physical child abuse 
in the United States. Cambridge, Mass.: Harvard Univ. Press, 1970) appeared after the 


present review was accepted for publication. Although the book offers greater detail, the 
findings and conclusions are identical to those in the cited references. 


7. ON BEING SANE IN 
INSANE PLACES 


The following two articles discuss the question of mental illness, or 
insanity. The article by D. L. Rosenhan is a stunning (and frightening) 
report on what happened to eight pseudopatients who recently faked 
their way into psychiatric hospitals across the United States. Despite 
the fact that on entry they ceased to fake any symptoms of abnormality 
and were both cooperative and friendly, none were detected as fakes by 
any of the staff (although several patients recognized their deception). 
The effect of being labeled “schizophrenic” on entry was profound. It 
appears that as far as a hospital staff in concerned, “once crazy, always 
crazy.” This indictment of our mental institutions is crushing. Rosenhan’s 
data raise serious questions about the legitimacy of psychiatric care 
as it exists, even in our best mental hospitals. The powerlessness and 
depersonalization of inmates of mental institutions is tragic, as Rosenhan 
illustrates: “I have records of patients who were beaten by staff for the 
sin of having initiated verbal contact. During my own experience, for 
example, one patient was beaten in the presence of other patients for 
having approached an attendant and told him, ‘I like you.’” " 

Patients in mental hospitals become invisible as humans, à condition 
that can do little to help them recover from mental problems: “А nurse 
unbuttoned her uniform to adjust her brassiere in the presence of an 
entire ward of viewing men. One did not have the sense that she was 
being seductive. Rather, she didn't notice it. A group of staff persons 
might point to a patient in the dayroom and discuss him animatedly, as 
if he were not there." 

According to Rosenhan, “it is clear that we cannot distinguish the sane 
from the insane in psychiatric hospitals.” The hospital itself becomes an 
“insane place” in which a patient, once labeled “insane,” stands virtually 
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no chance of ever being treated as a whole human being again. Rosenhan 
seems to understate the case by saying that the “consequences to patients 
hospitalized in such an environment—the powerlessness, depersonali- 
zation, segregation, mortification, and self-labeling—seem undoubtedly 
countertherapeutic.” | 
The idea that we may not be able to distinguish sanity from insanity 
in “insane places” such as our mental hospitals does not necessarily mean 
that insanity does not exist (although, as Roger Brown points out, there 
are individuals who are proposing that). If there really is such a thing as 
schizophrenia, what are its outstanding characteristics? If psychosis 
doesn’t exist, for example, why are there antipsychotic drugs, such as 
the phenothiazines, which are consumed by literally millions of people 
around the world? | 
Brown develops the notion that the inevitable mark of schizophrenia 
is “delusion,” of which there are almost infinite variations. How can we 
reconcile Rosenhan's view that psychiatrists cannot distinguish the sane 
from the insane (although patients can) with Brown’s report that “naive” 
college undergraduates are quite able to make reliable judgments (com- 
pared with Ph.D. clinical psychologists) about the degree of “schizo- 
phrenicity” by examining short samples of language from a patient? 
There is, of course, the cynical answer: The years of medical and psy- 
chological training add nothing to one’s ability to deal with schizophrenia. 
But Brown prefers a less pessimistic explanation. What is it? Are you 
convinced by Brown’s argument that there is a shared cultural view 
about “what constitutes adequate reality testing and rational coherence” 
and that in our culture this allows “us, sane ones” to detect the insanity 
in the language production of “them, the insane ones”? If not, what are 
we to do with our concepts of insanity and, more important, individuals 
whose view of reality is so different from ours as to impair either their 


own safety or ours or both? Perhaps the answer will not be in the hands 


of psychologists anyway. As Brown suggests, the biochemists may be 
the first to develop “ 


a precise etiology and an effective therapy.” 


ON BEING SANE IN INSANE PLACES 


D. L. Rosenhan, Stanford University 


If sanity and insanity exist, how shall we know them? 
The question is neither capricious nor itself insane. However much 
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we may be personally convinced that we can tell the normal from the 
abnormal, the evidence is simply not compelling. It is commonplace, 
for example, to read about murder trials wherein eminent psychiatrists 
for the defense are contradicted by equally eminent psychiatrists for the 
prosecution on the matter of the defendant’s sanity. More generally, 
there are a great deal of conflicting data on the reliability, utility, and 
meaning of such terms as “sanity,” “insanity,” “mental illness,” and 
“schizophrenia” (1). Finally, as early as 1934, Benedict suggested that 
normality and abnormality are not universal (2). What is viewed as 
normal in one culture may be seen as quite aberrant in another. Thus 
notions of normality and abnormality may not be quite as accurate as 
people believe they are. 

To raise questions regarding normality and abnormality is in no way 
to question the fact that some behaviors are deviant or odd. Murder is 
deviant. So, too, are hallucinations. Nor does raising such questions 
deny the existence of the personal anguish that is often associated with 
“mental illness.” Anxiety and depression exist. Psychological suffering 
exists. But normality and abnormality, sanity and insanity, and the 
diagnoses that flow from them may be less substantive than many believe 
them to be. 

At its heart the question of whether the sane can be distinguished from 
the insane (and whether degrees of insanity can be distinguished from 
each other) is a simple matter: do the salient characteristics that lead 
to diagnoses reside in the patients themselves or in the environments and 
contexts in which observers find them? From Bleuler through Kretchmer 
through the formulators of the recently revised Diagnostic and Statisti- 
cal Manual of the American Psychiatric Association the belief has been 
strong that patients present symptoms, that those symptoms can be 
categorized, and implicitly, that the sane are distinguishable from the 
insane. More recently, however, this belief has been questioned. Based 
in part on theoretical and anthropological considerations but also on 
philosophical, legal, and therapeutic ones, the view has grown that psy- 
chological categorization of mental illness is useless at best and downright 
harmful, misleading, and pejorative at worst. Psychiatric diagnoses, in 
this view, are in the minds of the observers and are not valid summaries 
of characteristics displayed by the observed (3-5). 

Gains can be made in deciding which of these is more nearly accurate 
by getting normal people (that is, people who do not have and have never 
suffered symptoms of serious psychiatric disorders) admitted to psychi- 
atric hospitals and then determining whether they were discovered to 
be sane and, if so, how. If the sanity of such pseudopatients were always 
detected, there would be prima facie evidence that a sane individual can 
be distinguished from the insane context in which he is found. Normality 
(and presumably abnormality) is distinct enough that it can be recognized 
wherever it occurs, for it is carried within the person. If, on the other 
hand, the sanity of the pseudopatients were never discovered, serious 
difficulties would arise for those who support traditional modes of psy- 
chiatric diagnosis. Given that the hospital staff was not incompetent, 


205 ON BEING SANE IN INSANE PLACES 


that the pseudopatient had been behaving as sanely as he had been out- 
side of the hospital, and that it had never been previously suggested 
that he belonged in a psychiatric hospital, such an unlikely outcome would 
support the view that psychiatric diagnosis betrays little about the pa- 
tient but much about the environment in which an observer finds him. 
This article describes such an experiment. Eight sane people gained 
secret admission to 12 different hospitals (6). Their diagnostic experi- 
ences constitute the data of the first part of this article; the remainder is 
devoted to a description of their experiences in psychiatric institutions. 
Too few psychiatrists and psychologists, even those who have worked in 
such hospitals, know what the experience is like. They rarely talk about 
it with former patients, perhaps because they distrust information com- 
ing from the previously insane. Those who have worked in psychiatric 
hospitals are likely to have adapted so thoroughly to the settings that 
they are insensitive to the impact of that experience. And while there 
have been occasional reports of researchers who submitted themselves 
to psychiatric hospitalization (7), these researchers have commonly 
remained in the hospitals for short periods of time, often with the knowl- 
edge of the hospital staff. It is difficult to know the extent to which they 
were treated like patients or like research colleagues. Nevertheless, 
their reports about the inside of the psychiatric hospital have been valu- 
able. This article extends those efforts, 


PSEUDOPATIENTS AND THEIR SETTINGS 


The eight pseudopatients were a varied group. One was a psychology 
graduate student in his 20’s. The remaining seven were older and “es- 


tablished.” Among them were three psychologists, a pediatrician, a 
psychiatrist, a painter, and a housewife. Three pseudopatients were 
women; five were men. All of them employed psuedonyms, lest their 
alleged diagnoses embarrass them later. Those who were in mental 
health professions alleged another occupation in order to avoid the special 
attentions that might be accorded by staff, as a matter of courtesy or 
caution, to ailing colleagues (8). With the exception of myself (I was the 
first pseudopatient, and my presence was known to the hospital admin- 
istrator and chief psychologist and, so far as I can tell, to them alone), 
the presence of pseudopatients and the nature of the research program 
was not known to the hospital staffs (9). 

The settings were similarly varied. In order to generalize the findings, 


admission into a variety of hospitals was sought. The 12 hospitals in the 
sample were located in five different states on the East and West coasts. 
Some were old and shabby; 


Some were quite new. Some were research- 
oriented; others not. Some had good staff-patient ratios; others were 
quite understaffed. Only one was a strictly private hospital. All of the 


others were supported by state or federal funds or, in one instance, by 
university funds. 


After calling the hos 


pital for an appointment, the pseudopatient ar- 
rived at the admission: 


5 office complaining that he had been hearing 
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voices. Asked what the voices said, he replied that they were often un- 
clear, but as far as he could tell they said “empty,” “hollow,” and “thud.” 
The voices were unfamiliar and were of the same sex as the pseudo- 
patient. The choice of these symptoms was occasioned by their apparent 
similarity to existential symptoms. Such symptoms are alleged to arise 
from painful concerns about the perceived meaninglessness of one’s life. 
It is as if the hallucinating person were saying, “My life is empty and 
hollow.” The choice of these symptoms was also determined by the ab- 
sence of a single report of existential psychoses in the literature. 

Beyond alleging the symptoms and falsifying name, vocation, and 
employment, no further alterations of person, history, or circumstances 
were made. The significant events of the pseudopatient’s life history 
were presented as they had actually occurred. Relationships with par- 
ents and siblings, with spouse and children, with people at work and in 
School, consistent with the aforementioned exceptions, were described 
as they were or had been. Frustrations and upsets were described along 
with joys and satisfactions. These facts are important to remember. If 
anything, they strongly biased the subsequent results in favor of detect- 
ing sanity, since none of their histories or current behaviors were seri- 
ously pathological in any way. 

Immediately upon admission to the psychiatrie ward, the pseudo- 
patient ceased simulating any symptoms of abnormality. In some cases 
there was a brief period of mild nervousness and anxiety, since none 
of the pseudopatients really believed that they would be admitted so 
easily. Indeed, their shared fear was that they would be immediately 
exposed as frauds and greatly embarrassed. Moreover, many of them 
had never visited a psychiatrie ward; even those who had nevertheless 
had some genuine fears about what might happen to them. Their ner- 
vousness, then, was quite appropriate to the novelty of the hospital 
setting, and it abated rapidly. 

Apart from that short-lived nervousness, the pseudopatient behaved 
оп the ward as he "normally" behaved. The pseudopatient spoke to pa- 
tients and staff as he might ordinarily. Because there is uncommonly 
little to do on a psychiatrie ward, he attempted to engage others in con- 
versation. When asked by staff how he was feeling, he indicated that he 
was fine, that he no longer experienced symptoms. He responded to 
instruetions from attendants, to calls for medication (which was not 
swallowed), and to dining-hall instructions. Beyond such activities as 
were available to him on the admissions ward, he spent his time writing 
down his observations about the ward, its patients, and the staff. Ini- 
tially these notes were written “secretly,” but as it soon became clear 
that no one much cared, they were subsequently written on standard 
tablets of paper in such publie places as the dayroom. No secret was 
made of these activities. 

The pseudopatient, very much as a true psychiatrie patient, entered a 
hospital with no foreknowledge of when he would be discharged. Each 
was told that he would have to get out by his own devices, essentially 
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by convincing the staff that he was sane. The psychological stresses 
associated with hospitalization were considerable, and all but one of the 
pseudopatients desired to be discharged almost immediately after being 
admitted. They were, therefore, motivated not only to behave sanely 
but to be paragons of cooperation. That their behavior was in no way 
disruptive is confirmed by nursing reports, which have been obtained on 
most of the patients. These reports uniformly indicate that the patients 
were “friendly,” “cooperative,” and “exhibited no abnormal indications. 


THE NORMAL ARE NOT DETECTABLY SANE 


Despite their public “show” of sanity, the pseudopatients were never 
detected. Admitted, except in one case, with a diagnosis of schizophrenia 
(10), each was discharged with a diagnosis of schizophrenia “in remis- 
sion.” The label “in remission” should in no way be dismissed as a formal- 
ity, for at no time during any hospitalization had any question been raised 
about any pseudopatient’s simulation. Nor are there any indications in 
the hospital records that the pseudopatient’s status was suspect. Rather, 
the evidence is strong that, once labeled schizophrenic, the pseudopatient 
was stuck with that label. If the pseudopatient was to be discharged, 


he must naturally be “in remission”; but he was not sane, nor in the insti- 
tution’s view had he ever been sane 

The uniform failure to reco 
quality of the hospitals, for alt. 
among them, 


The pseudopatients were not, in fact, с 
clearly speaks more to traditi 
of opportunity. 
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more inclined to call a healthy person sick (a false positive, type 2) than 
a sick person healthy (a false negative, type 1). The reasons for this are 
not hard to find: it is clearly more dangerous to misdiagnose illness than 
health. Better to err on the side of caution, to suspect illness even among 
the healthy. 

But what holds for medicine does not hold equally well for psychiatry. 
Medical illnesses, while unfortunate, are not commonly pejorative. 
Psychiatric diagnoses, on the contrary, carry with them personal, legal, 
and social stigmas (12). It was therefore important to see whether the 
tendency toward diagnosing the sane insane could be reversed. The 
following experiment was arranged at a research and teaching hospital 
whose staff had heard these findings but doubted that such an error could 
occur in their hospital. The staff was informed that at some time during 
the following 3 months, one or more pseudopatients would attempt to 
be admitted into the psychiatrie hospital. Each staff member was asked 
to rate each patient who presented himself at admissions or on the ward 
according to the likelihood that the patient was a pseudopatient. A 10- 
point scale was used, with a 1 and 2 reflecting high confidence that the 
patient was a pseudopatient. 

Judgments were obtained on 193 patients who were admitted for psy- 
chiatric treatment. All staff who had had sustained contact with or pri- 
mary responsibility for the patient—attendants, nurses, psychiatrists, 
physicians, and psychologists—were asked to make judgments. Forty- 
one patients were alleged, with high confidence, to be pseudopatients by 
at least one member of the staff. Twenty-three were considered suspect 
by at least one psychiatrist. Nineteen were suspected by one psychiatrist 
and one other staff member. Actually, no genuine pseudopatient (at least 
from my group) presented himself during this period. 

This experiment is instructive. It indicates that the tendency to desig- 
nate sane people as insane can be reversed when the stakes (in this case, 
prestige and diagnostic acumen) are high. But what can be said of the 
19 people who were suspected of being “sane” by one psychiatrist and 
another staff member? Were these people truly “sane,” or was it rather 
the case that in the course of avoiding the type 2 error the staff tended 
to make more errors of the first sort— calling the crazy “sane”? There 
is no way of knowing. But one thing is certain: any diagnostic process 
that lends itself so readily to massive errors of this sort cannot be a very 
reliable one. 


THE STICKINESS OF PSYCHODIAGNOSTIC LABELS 


Beyond the tendency to call the healthy sick—a tendency that accounts 
better for diagnostic behavior on admission than it does for such behavior 
after a lengthy period of exposure—the data speak to the massive role 
of labeling in psychiatrie assessment. Having once been labeled schizo- 
phrenic, there is nothing the pseudopatient can do to overcome the tag. 
The tag profoundly colors others' perceptions of him and his behavior. 


и 
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From one viewpoint these data are hardly surprising, for it has long 
been known that elements are given meaning by the context in which 
they occur. Gestalt psychology made this point vigorously, and Asch 
(13) demonstrated that there are "central" personality traits (such as 
"warm" versus “cold”) which are so powerful that they markedly color 
the meaning of other information in forming an impression of a given 
personality (14). “Insane,” “schizophrenic,” “manic-depressive, and 
“crazy” are probably among the most powerful of such central traits. 
Once a person is designated abnormal, all of his other behaviors and 
characteristics are colored by that label. Indeed, that label is so power- 
ful that many of the pseudopatients’ normal behaviors were overlooked 
entirely or profoundly misinterpreted. Some examples may clarify this 
issue. 

Earlier I indicated that there were no changes in the pseudopatient's 
personal history and current status beyond those of. name, employment, 
and, where necessary, vocation. Otherwise, a veridical description of 
personal history and circumstances was offered. Those circumstances 
were not psychotic. How were they made consonant with the diagnosis 
of psychosis? Or were those diagnoses modified in such a way as to bring 


them into accord with the circumstances of the pseudopatient’s life, as 
described by him? 


As far as I can determine 
relative health of the circu 
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phrenic reaction (15). Nothing of an ambivalent nature had been de- 
scribed in relations with parents, spouse, or friends. To the extent that 
ambivalence could be inferred, it was probably not greater than is found 
in all human relationships. It is true the pseudopatient’s relationships 
with his parents changed over time, but in the ordinary context that 
would hardly be remarkable—indeed, it might very well be expected. 
Clearly, the meaning ascribed to his verbalizations (that is, ambivalence, 
affective instability) was determined by the diagnosis: schizophrenia. 
An entirely different meaning would have been ascribed if it were known 
that the man was “normal.” 

All pseudopatients took extensive notes publicly. Under ordinary 
circumstances, such behavior would have raised questions in the minds 
of observers, as, in fact, it did among patients. Indeed, it seemed so 
certain that the notes would elicit suspicion that elaborate precautions 
Were taken to remove them from the ward each day. But the precautions 
proved needless. The closest any staff member came to questioning these 
notes occurred when one pseudopatient asked his physician what kind 
of medication he was receiving and began to write down the response. 
“You needn’t write it,” he was told gently. “If you have trouble remem- 
bering, just ask me again.” 

If no questions were asked of the pseudopatients, how was their writ- 
ing interpreted? Nursing records for three patients indicate that the 
writing was seen as an aspect of their pathological behavior. “Patient 
engages in writing behavior” was the daily nursing comment on one of 
the pseudopatients who was never questioned about his writing. Given 
that the patient is in the hospital, he must be psychologically disturbed. 
And given that he is disturbed, continuous writing must be a behavioral 
manifestation of that disturbance, perhaps a subset of the compulsive 
behaviors that are sometimes correlated with schizophrenia. 

One tacit characteristic of psychiatrie diagnosis is that it locates the 
Sources of aberration within the individual and only rarely within the 
complex of stimuli that surrounds him. Consequently, behaviors that 
are stimulated by the environment are commonly misattributed to the 
patient's disorder. For example, one kindly nurse found a pseudopatient 
pacing the long hospital corridors, “Nervous, Mr. X?” she asked. “No, 
bored,” he said. 

The notes kept by pseudopatients are full of patient behaviors that 
Were misinterpreted by well-intentioned staff. Often enough, a patient 
would go “berserk” because he had, wittingly or unwittingly, been mis- 
treated by, say, an attendant. A nurse coming upon the scene would 
rarely inquire even cursorily into the environment stimuli of the patient’s 
behavior. Rather, she assumed that his upset derived from his pathology, 
not from other staff members. Occasionally, the staff might assume that 
the patient’s family (especially when they had recently visited) or other 
patients had stimulated the outburst. But never were the staff found 
to assume that one of themselves or the structure of the hospital had 
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anything to do with a patient’s behavior. One psychiatrist pointed to a 
group of patients who were sitting outside the cafeteria entrance half 
an hour before lunchtime. To a group of young residents he indicated 
that such behavior was characteristic of the oral-acquisitive nature of 
the syndrome. It seemed not to occur to him that there were very few 
things to anticipate in a psychiatric hospital besides eating. 

A psychiatrist label has a life and an influence of its own. Once the 
impression has been formed that the patient is schizophrenic, the expec- 
tation is that he will continue to be schizophrenic. When a sufficient 
amount of time has passed, during which the patient has done nothing 
bizarre, he is considered to be in remission and available for discharge 
But the label endures beyond discharge, with the unconfirmed expecta- 
tion that he will behave as a schizophrenic again. Such labels, conferred 
by mental health professionals, are as influential on the patient as they 
are on his relatives and friends, and it should not surprise anyone that 
the diagnosis acts on all of them as a self-fulfilling prophecy. Eventually, 
the patient himself accepts the diagnosis, with all of its surplus meanings 
and expectations, and behaves accordingly (15). 

The inferences to be made from these matters are quite simple. Much 
as Zigler and Phillips have demonstrated that there is enormous overlap 
in the symptoms presented by patients who have been variously diag- 


nosed (16), so there is enormous overlap in the behaviors of the sane and 
the insane. The sane are not “sane” 
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THE EXPERIENCE OF PSYCHIATRIC HOSPITALIZATION 


The term “mental illness” is of recent origin. It was coined by people 
who were humane in their inclinations and who wanted very much to 
raise the station of (and the public's sympathies toward) the psychologi- 
cally disturbed from that of witches and “crazies” to one that was akin 
to the physically ill. And they were at least partially successful, for the 
treatment of the mentally ill as improved considerably over the years. 
But while treatment has improved, it is doubtful that people really re- 
gard the mentally ill in the same way that they view the physically ill. 
A broken leg is something one recovers from, but mental illness allegedly 
endures forever (18). A broken leg does not threaten the observer, but 
а crazy schizophrenic? There is by now a host of evidence that attitudes 
toward the mentally ill are characterized by fear, hostility, aloofness, 
Suspicion, and dread (19). The mentally ill are society’s lepers. 

That such attitudes infect the general population is perhaps not sur- 
prising, only upsetting. But that they affect the professionals—atten- 
dants, nurses, physicians, psychologists, and social workers—who treat 
and deal with the mentally ill is more disconcerting, both because such 
attitudes are self-evidently pernicious and because they are unwitting. 
Most mental health professionals would insist that they are sympathetic 
toward the mentally ill, that they are neither avoidant nor hostile. But 
it is more likely that an exquisite ambivalence characterizes their rela- 
tions with psychiatric patients, such that their avowed impulses are 
only part of their entire attitude. Negative attitudes are there too and 
can easily be detected. Such attitudes should not surprise us. They are 
the natural offspring of the labels patients wear and the places in which 
they are found. 

Consider the structure of the typical psychiatric hospital. Staff and 
patients are strictly segregated. Staff have their own living space, in- 
cluding their dining facilities, bathrooms, and assembly places. The 
glassed quarters that contain the professional staff, which the pseudo- 
patients came to call “the cage,” sit out on every dayroom. The staff 
emerge primarily for caretaking purposes—to give medication, to con- 
duct a therapy or group meeting, to instruct or reprimand a patient. 
Otherwise, staff keep to themselves, almost as if the disorder that afflicts 
their charges is somehow catching. 

So much is patient-staff segregation the rule that, for four public hos- 
Pitals in which an attempt was made to measure the degree to which staff 
and patients mingle, it was necessary to use “time out of the staff cage” 
as the operational measure. While it was not the case that all time spent 
out of the cage was spent mingling with patients (attendants, for exam- 
ple, would occasionally emerge to watch television in the dayroom), it 
was the only way in which one could gather reliable data on time for 
measuring. 

The average amount of time spent by attendants outside of the cage 
was 11.3 percent (range, 3 to 52 percent). This figure does not represent 
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only time spent mingling with patients, but also includes time spent on 
such chores as folding laundry, supervising patients while they shave, 
directing Ward cleanup, and sending patients to off-ward activities. It 
was the relatively rare attendants who spent time talking with patients 
or playing games with them. It proved impossible to obtain a “percent 
mingling time” for nurses, since the amount of time they spent out of 
the cage was too brief. Rather, we counted instances of emergence from 
the cage. On the average, daytime nurses emerged from the cage 11.5 
times per shift, including instances when they left the ward entirely 
(range, 4 to 39 times). Late afternoon and night nurses were even less 
available, emerging on the average 9.4 times per shift (range, 4 to 41 
times). Data on early morning nurses, who arrived usually after midnight 
and departed at 8 a.m., are not available because patients were asleep 
during most of this period. 

Physicians, especially psychiatrists, were even less available. They 
were rarely seen on the wards. Quite commonly, they would be seen only 
when they arrived and departed, with the remaining time being spent 
in their offices or in the cage. On the average, physicians emerged on 
the ward 6.7 times per day (range, 1 to 17 times). It proved difficult to 
make an accurate estimate in this regard, since physicians often main- 
tained hours that allowed them to come and go at different times. 

The hierarchical organization of the psychiatric hospital has been com- 
mented on before (20), but the latent meaning of that kind of organiza- 
tion is worth noting again 
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to approach a particular member of the staff more than once a day, lest 
the staff member become suspicious or irritated. In examining these 
data, remember that the behavior of the pseudopatients was neither 
bizarre nor disruptive. One could indeed engage in good conversation 
with them. 

The data for these experiments are shown in Table 1, separately for 
physicians (column 1) and for nurses and attendants (column 2). Minor 
differences between these four institutions were overwhelmed by the 
degree to which staff avoided continuing contacts that patients had ini- 
tiated. By far, their most common response consisted of either a brief 
response to the question, offered while they were “on the move” and 
with head averted, or no response at all. 

The encounter frequently took the following bizarre form: (pseudo- 
patient) “Pardon me, Dr. X. Could you tell me when I am eligible for 
grounds privileges?” (physician) “Good morning, Dave. How are you 
today?” (Moves off without waiting for a response.) 

It is instructive to compare these data with data recently obtained at 
Stanford University. It has been alleged that large and eminent uni- 
versities are characterized by faculty who are so busy that they have 
no time for students. For this comparison, a young lady approached 
individual faculty members who seemed to be walking purposefully to 
some meeting or teaching engagement and asked them the following six 
questions. | 

1) “Pardon me, could you direct me to Encina Hall?” (at the medical 
school: “, . . to the Clinical Research Center?”). 

2) “Do you know where Fish Annex is?” (there is no Fish Annex at 
Stanford). 

3) “Do you teach here?” . 

4) *How does one apply for admission to the college?" (at the medical 
School: *. . . to the medical school?"). 

5) "Is it difficult to get in?" 

6) "Is there financial aid?" 

Without exception, as can be seen in Table 1 (column 3), all of the ques- 
tions were answered. No matter how rushed they were, all respondents 
not only maintained eye contact but stopped to talk. Indeed, many of 
the respondents went out of their way to direct or take the questioner 
to the office she was seeking, to try to locate “Fish Annex,” or to discuss 
with her the possibilities of being admitted to the university. | 

Similar data, also shown in Table 1 (columns 4, 5, and 6), were obtained 
in the hospital. Here too, the young lady came prepared with six ques- 
tions. After the first question, however, she remarked to 18 of her respon- 
dents (column 4). “I’m looking for a psychiatrist,” and to 15 others (col- 
umn 5), “I’m looking for an internist.” Ten other respondents received 
no inserted comment (column 6). The general degree of cooperative re- 
sponses is considerably higher for these university groups than it was for 
pseudopatients in psychiatrie hospitals. Even so, differences are ap- 
parent within the medical school setting. Once having indicated that 
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she was looking for a psychiatrist, the degree of cooperation elicited 
was less than when she sought an internist. 


POWERLESSNESS AND DEPERSONALIZATION 


Eye contact and verbal contact reflect concern and individuation; their 
absence, avoidance and depersonalization. The data I have presented 
do not do justice to the rich daily encounters that grew up around matters 
of depersonalization and avoidance. I have records of patients who were 
beaten by staff for the sin of having initiated verbal contact. During my 
own experience, for example, one patient was beaten in the presence of 
other patients for having approached an attendent and told him, “I like 
you.” Occasionally, punishment meted out to patients for misdemeanors 
seemed so excessive that it could not be justified by the most radical 
interpretations of psychiatric canon. Nevertheless, they appeared to go 
unquestioned. Tempers were often short. A patient who had not heard 
a call for medication would be roundly excoriated, and the morning atten- 
dants would often wake patients with, “Соте on, you m———f———s, 
out of bed!” 

Neither anecdotal nor “hard” data can convey the overwhelming sense 
of powerlessness which invades the individual as he is continually ex- 
posed to the depersonalization of the psychiatric hospital. It hardly mat- 
ters which psychiatric hospital—the excellent public ones and the very 
plush private hospital were better than the rural and shabby ones in this 
regard, but again, the features that psychiatric hospitals had in common 
overwhelmed by far their apparent differences. 

Powerlessness was evident everywhere. The patient is deprived of 
many of his legal rights by dint of his psychiatric commitment (21). He 
is shorn of credibility by virtue of his psychiatric label. His freedom of 
movement is restricted. He cannot initiate contact with the staff but may 
only respond to such overtures as they make. Personal privacy is mini 
mal. Patient quarters and possessions can be entered and examined by 
any staff member for whatever reason. His personal history and anguish 
is available to any staff member (often including the "grey lady" and 
"candy striper" volunteer) who chooses to read his folder, regardless of 
their therapeutic relationship to him. His personal hygiene and waste 
evaeuation are often monitored. The water closets may have no doors. 

At times, depersonalization reached such proportions that pseudo- 
patients had the sense that they were invisible, or at least unworthy of 
account. Upon being admitted, I and other pseudopatients took the initial 
physical examinations in a semipublie room, where staff members went 
about their own business as if we were not there. . 

On the ward, attendants delivered verbal and occasionally serious 
physical abuse to patients in the presence of other observing patients, 
Some of whom (the pseudopatients) were writing it all down. Abusive 
behavior, on the other hand, terminated quite abruptly when other staff 
members were known to be coming. Staff are credible witnesses. Pa- 
tients are not. 
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A nurse unbuttoned her uniform to adjust her brassiere in the presence 
of an entire ward of viewing men. One did not have the sense that she 
was being seductive. Rather, she didn’t notice us. A group of staff per- 
sons might point to a patient in the dayroom and discuss him animatedly, 
as if he were not there. 

One illuminating instance of depersonalization and invisibility occurred 
with regard to medications. All told, the pseudopatients were admin- 
istered nearly 2100 pills, including Elavil, Stelazine, Compazine, and 
Thorazine, to name but a few. (That such a variety of medications should 
have been administered to patients presenting identical symptoms is 
itself worthy of note.) Only two were swallowed. The rest were either 
pocketed or deposited in the toilet. The pseudopatients were not alone 
in this. Although I have no precise records on how many patients re- 
jected their medications, the pseudopatients frequently found the medi- 
cations of other patients in the toilet before they deposited their own. 
As long as they were cooperative, their behavior and the pseudopatients' 


own in this matter, as in other important matters, went unnoticed 
throughout, 


Reactions to such 


pital as participant observers 
"belong," they nevertheless found 
the process of depersonalization. 


en to conceal his professional 
rained for quite some time to 


epersonalization? I have already mentioned 
by all of us toward the mentally ill —inelud- 
ttitudes characterized by fear, distrust, and 
one hand and benevolent intentions on the 

n this instance as in others, to avoidance. 
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residents, and physicians combined ranged from 3.9 to 25.1 minutes, with 
an overall mean of 6.8 (six pseudopatients over a total of 129 days of 
hospitalization). Included in this average are time spent in the admissions 
interview, ward meetings in the presence of a senior staff member, group 
and individual psychotherapy contacts, case presentation conferences, 
and discharge meetings. Clearly, patients do not spend much time in 
interpersonal contact with doctoral staff. And doctoral staff serve as 
models for nurses and attendants. 

There are probably other sources. Psychiatric installations are pres- 
ently in serious financial straits. Staff shortages are pervasive; staff time 
at a premium. Something has to give, and that something is patient con- 
tact. Yet, while financial stresses are realities, too much can be made of 
them. I have the impression that the psychological forces that result in 
depersonalization are much stronger than the fiscal ones and that the 
addition of more staff would not correspondingly improve patient care 
in this regard. The incidence of staff meetings and the enormous amount 
of record-keeping on patients, for example, have not been as substan- 
tially reduced as has patient contact. Priorities exist, even during hard 
times. Patient contact is not a significant priority in the traditional psy- 
chiatric hospital, and fiscal pressures do not account for this. Avoidance 
and depersonalization may. у 

Heavy reliance upon psychotropic medication tacitly contributes to 
depersonalization by convincing staff that treatment is indeed being con- 
ducted and that further patient contact may not be necessary. Even here, 
however, caution needs to be exercised in understanding the role of psy- 
chotropic drugs. If patients were powerful rather than powerless, if they 
were viewed as interesting individuals rather than diagnostic entities, 
if they were socially significant rather than social lepers, if their anguish 
truly and wholly compelled our sympathies and concerns, would we not 
seek contact with them, despite the availability of medications? Perhaps 
for the pleasure of it all? 


THE CONSEQUENCES OF LABELING AND DEPERSONALIZATION 


Whenever the ratio of what is known to what needs to be known ap- 
proaches zero, we tend to invent “knowledge” and assume that we under- 
stand more than we actually do. We seem unable to acknowledge that we 
simply don’t know. The needs for diagnosis and remediation of behavioral 
and emotional problems are enormous. But rather than acknowledge that 
we are just embarking on understanding, we continue to label patients 
“schizophrenic,” “manic-depressive,” and “insane,” as if in those words 
we had captured the essence of understanding. The facts of the matter 
are that we have known for a long time that diagnoses are often not useful 
or reliable, but we have nevertheless continued to use them. We now 
know that we cannot distinguish insanity from sanity. It is depressing 
to consider how that information will be used. 

Not merely depressing, but frightening. How many people, one won- 
ders, are sane but not recognized as such in our psychiatric institutions? 
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have been needlessly stripped of their privileges of citizen- 
ae = right to vote and drive to that of handling their m 
counts? How many have feigned insanity in order to avoid the cr њен 
consequences of their behavior, and conversely, how many would rat E 
Stand trial than live interminably in a psychiatric Tospital = Buk a 
wrongly thought to be mentally ill? How many have been ла hag 
by well-intentioned but nevertheless erroneous diagnoses? On the = 
point, recall again that a type 2 error in psychiatric diagnosis does oe 
have the same consequences it does in medical diagnosis. A diagnosis о 
cancer that has been found to be in error is cause for celebration. But 
psychiatrie diagnoses are rarely found to be in error. The label sticks, 
a mark of inadequacy forever. 


Finally, how many patients might be “sane” outside the psychiatric 


"bor nether people? Goffman (4) 


inmates—they were, after all, not real p 
that these processes of Socialization to 


a psychiatrie hospital provide 
useful attitudes or habits of response for 


living in the “real world." 
SUMMARY AND CONCLUSIONS 

Itis clear that we cannot distin 
atric hospitals. The hospital itsel 
the meanings of behavior can easil 


onpejorative environment. Clearly, to the 

extent that we refrain from Sending the distressed to insane places, our 
i ely to be distorted. (The risk of distorted 
‚ 13 always present, since we are much more 
aviors and verbalizations than we are to 
hat often promote them. At issue here is 


ave shown, the magnitude of distortion 
is exceedingly high in the extrem i 
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Catch-22 position of psychiatrie patients. Simply reading materials in 
this area will be of help to some such workers and researchers. For others, 
directly experiencing the impact of psychiatrie hospitalization will be of 
enormous use. Clearly, further research into the social psychology of such 
total institutions will both facilitate treatment and deepen understanding. 

I and the other pseudopatients in the psychiatric setting had distinctly 
negative reactions. We do not pretend to describe the subjective experi- 
ences of true patients. Theirs may be different from ours, particularly with 
the passage of time and the necessary process of adaptation to one’s envi- 
ronment. But we can and do speak to the relatively more objective indices 
of treatment within the hospital. It could be a mistake, and a very un- 
fortunate one, to consider that what happened to us derived from malice 
or stupidity on the part of the staff. Quite the contrary, our overwhelm- 
ing impression of them was of people who really cared, who were com- 
mitted, and who were uncommonly intelligent. Where they failed, as they 
Sometimes did painfully, it would be more accurate to attribute those 
failures to the environment in which they, too, found themselves than 
to personal callousness. Their perceptions and behavior were controlled 
by the situation, rather than being motivated by a malicious disposition. 
In a more benign environment, one that was less attached to global diag- 
nosis, their behaviors and judgments might have been more benign and 


effective. 
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pseudopatient is likely to experience in the 
hospital, there are legal and social ones that, combined, require considerable attention 
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prepared for each of the entering pseudopatients and an attorney was kept “on call 


during every hospitalization. I am grateful to John Kaplan and Robert Bartels for legal 
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SCHIZOPHRENIA, LANGUAGE, AND REALITY 
Roger Brown, Harvard University 


The longest playing movie in the history of the city of Cambridge is a 
French film called the King of Hearts. I felt that I ought to see a movie 
30 successful in the city where I live, and so last December I did. Al- 
though it is about World War I, set in a small town in France that has 
been deserted by Allied troops and not yet reached by an advancing Ger- 
man army, the King of Hearts is a comedy. And a pleasant one, made 
with considerable art. But nothing in it seemed to me to account for its 
great popularity, after more than a year, with audiences largely made 
up of students. Nothing, that is, except its last scene. 

When the French town in the film is evacuated, the gateway of the 
asylum for the insane, the aliénés, is unlocked, and the inmates move 
out into the town which they now possess and where they are free to act 
out their fantasies. One thinks he is a prince of the church; another a great 
diplomat; another a general; and several old people think they are very 
small children. All dress and act accordingly and cooperate in one e 
other's delusions in such a way as to yield a bizarre but harmless an 
happy communal life. In the last scene a platoon of Allied сыр À 
to the town just as an advance platoon of Germans arrives. e x 
forces meet in the Town Square, dutifully line up in opposed ranks, rai 
their rifles on command, and shoot one another dead—without теше 
der. All this is observed by the insane, and as the film closes, they slowly 
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trail back to the asylum, and when all are inside, a hand snakes out be- 
tween the bars and locks the gate—against the world. 

In December when I saw the King of Hearts, I was reading several 
of the books of Thomas Szasz, who argues that the notion of mental ill- 
ness is a myth and that institutional psychiatry bears a close functional 
resemblance to the medieval Inquisition (Szasz, 1961, 1963, and else- 
where), being a social mechanism for disposing of people whose ideas 
constitute a threat to established powers. I had already read R. D. Laing 
(1960, and others) though with very uneven understanding, and had 
become acquainted with the notion that psychotics may have access to 
truths that are closed to the rest of us. Then I had seen an occasional 
leaflet of the Insane Liberation Movement—which is just that. АП of 
these things together suggested that the ideas about psychoses, espe- 
cially schizophrenia, which I had absorbed in graduate school some 20 
years ago and which had not seemed to be challenged in any fundamental 
way by the professional reading I had done Since, were nevertheless no 
longer considered beyond dispute. Was it the phenothiazines, I won- 
dered, that had made it possible to think so differently about schizo- 
phrenia, or were the new ideas mainly extreme manifestations of the 
surge of egalitarianism evident now in all human studies? In any case 
I felt that the introductory psychology course at Harvard had some obli- 
gation to include these ideas, and so I decided to take on schizophrenia 

even though my lack of clinical experience 
nt. Still, I thought, the class can read more 


years; : 
might n ! 


Schizophrenia, Language, and Reality 225 


ran the range of subtypes of schizophrenia. I should add that everything 
I was allowed to do had the advance approval of the patients and that 
care was taken to see that I understood the importance of medical confi- 
dentiality and would observe it. 

The experience was absurdly brief and biased in many ways. I took 
part in no therapeutic sessions. The patients I talked with were all re- 
ceiving medication, and they were just those who chose to talk with me. 
Except for the session with Schizophrenics Anonymous, the patients 
were all in expensive private hospitals widely regarded as particularly 
good ones. I do not expect that experienced clinical psychologists can 
learn anything from my observations but, on the contrary, am sure they 
can straighten me out on many things. Of course, I learned a great deal, 
but that is because I started from a base line close to zero. The experience 
in general was sad, sometimes psychologically stressful, but above all, 
fascinating to a psychologist whose experience had been limited to the 
laboratory and classroom. I presume to impose it on you only because 
there must be many psychologists like me and because the problems 
that appear to the eye of the cognitive social psychologist are somewhat 
different from those that are most salient to the psychologist who does 
therapy. | 

There are some obvious things that one learns very quickly. Twenty 
years of reading experimental studies that contrast performance on some 
laboratory task of normals and schizophrenics had, without my realizing 
it, in the case of the latter group, though not the former, built up a pre- 
conception of a homogeneous population, a monolith, bearing the mark of 
"schizophrenicity." What I discovered after a few conversations, of 
course, is that persons called schizophrenic are as diverse as persons 
called normal. But, one thinks, they must have some common quality or 
qualities, or else why are they labeled with one term, and I kept trying to 
find it as I talked with one person after another. In the first few days I 
found my mind was doing something that my reason certainly rejected. 
It was trying to find some perceptual property common to len 
ics, some special look or sound that the distance receptors, the eye an 
ear, could make use of in the future. In time, all hypotheses failed—there 
seemed to be no reliable stigma. Indeed, when that fact first came home 


to me, it was with a certain shock because one then recognizes that any- 


one, however he looks or talks in ordinary situations, may have unuttered 


thoughts that could qualify him as schizophrenic. 


DELUSIONS 


The one thing I eventually did find with the people I talked Een E 
I feel supported enormously by the fact that Norman Camer ги wi е 
great experience said the same thing, is that however norma the pn ie : 
was in talking about most subjects, eventually one came upon an area о 
delusion. What delusions are is а difficult question to which I want to 
return. Here I want only to record that when the patient hit such an area, 
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my reaction was generally quite sharp and unequivocal. In effect I said 
to myself: “I have followed you perfectly well thus far, but now you are 
saying things I cannot agree with.” For example, one young man who 
came on as very happy-go-lucky and who made one positively envy his 
sunny outlook said that when he left the hospital he would go directly 
to Scotland, where they were making the movie of Fiddler on the Roof, 
because he wanted to try out for the lead. A young girl, who had once 
been a student of mine, talked pleasantly on a number of subjects familiar 
to both of us and then began to look so frightened I asked her if anything 
were wrong. She said, “I feel this room is being bugged.” “By whom?” 
I asked. “Big Brother?” was her suggestion. An intense young married 
woman, very religious, said she let the Bible guide her life. I asked for 
an example, and she told me of the day she put her finger down at random 
оп a page, landing on the word God and noticed that God was dog spelled 
backward. For this impiety she felt the family dog must die, and she 
killed him. 

Without inquiring into the nature of delusions but simply postulating 
that they often are identified easily as such by most people in a society, 
notice that the three I have cited and indeed delusions generally do not 
seem to have the properties they should have if the King of Hearts were 


to be taken seriously. Delusions are as idiosyncratic as individual schizo- 
phrenics or normals. The 


another. And patients do 
lusions; they 


PATIENT GOVERNMENT 


I will mention one more 1 
h t 
learn what to do about it. тежи 


Patient government is something like student 
ment in prisons. It is 
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Ee wan that we never shall be really able to evaluate. Some re- 
oh utes т\н ене some are unsympathetic. Laing himself moved 
erste A out a year. Normal neighbors broke down the doors, and 
SUD smashed the windows. Some patients seemed to get better, 
ud к шеше ү walls with feces. However, the same kind of 
hes ae Eid LS elsewhere, and results may be produced that are 
Жүз cou find that the best they can manage is patient govern- 
he p p _ a point. Suppose the various privileges a patient requests 
eee d? to be granted or not, according to the vote of his fellow 
an i i saw a likable young student-age patient test this system by 
em gies every privilege in the book, including going home on week- 
ат el ai ме out that the other patients were not willing to say no to 
a 3 in the end the psychiatrist had to. The last time the young man 
A ome, he had smuggled in LSD when he returned and passed it 
kad ү, : couple of other patients, with one becoming so agitated she 
Qu. e put in packs and another attempting suicide. The limits of pa- 
government, like the limits of student government and inmate 
аа are not lost on those to whom some but not full power has 
een given. In varying degree they lend a quality of make-believe to what 
ras on, а quality that I think they find somewhat depressing. Probably, 
wever, limited power is less depressing than none at all. 


SCHIZOPHRENIC SPEECH 
Throughout my three weeks I kept my ears cocked for what has been 
called schizophrenic speech, a kind of speech still often called regressed, 


which suggests childlike, though not everyone who uses the term intends 
spoken to with what I call 


that sense. While I fairly often heard patients 

nursery school intonation, а kind of exaggerated prosody that most adults 
use with small children, I have to report that in my three weeks I never 
heard anything childlike from a patient nor indeed anything that I would 
Want to set apart as schizophrenic speech. As far as my experience goes 
and within the limits of a particular definition of speech, I would have to 
conclude that there is no such thing as schizophrenic speech. I hasten to 
add that I encountered plenty of schizophrenic thought, but that is an- 


other matter. 
Since speech or, 

making thought public, 

Schizophrenic language 


з the principal vehicle for 
hors who have written of 


have intended what I will call thought, perhaps 


favoring the term language because it is closer to the observable be- 
havioral level. I do not know how to make a deep distinction between 
language and thought, but only a preliminary working distinction. My 
Suggestion is that we confine the word language to just those systems 
that linguistie science has thus far found it possible to characterize in 
terms of general rules. These systems would be most clearly the phono- 
logical, ineluding the inventory of phonemic contrasts as well as the rules 


more generally, language i 
I know that many aut! 
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of phonotactics, describing how sounds may be combined, and the gram- 
matical, including rules of word formation and of sentence formation. The 
conquest of semantics, or meaning, is but little advanced in linguistics. 
Katz and Fodor (e.g., Katz, 1964; Katz & Fodor, 1963) in various papers 
have described a scheme for the representation of the kind of maximally 
general semantic information that gets entered in dictionaries. Infor- 
mation on this level, if it appears also in our mental dictionaries, could 
serve, as Katz and Fodor have shown, to account for a limited but impor- 
tant set of semantic skills possessed by the adult native speaker—such 
skills as the disambiguation of word meanings on the basis of sentence 
context, the identification of semantic anomalies, and of contradictions 


and synonymies. Their scheme, for instance, would mark “John aston- 
ished the rock” 


like “John hit the ball,” 
than the “social event” sense, Katz and Fod 
out that their Scheme, and it is still only a s 
plished program, has nothi 


Speech in preschool children in what 
h is the first Stage of sentence con- 
out phonology, grammar, and even 


ort and only very gradually get longer. 


2. Consonant clusters, an aspect of phonotactics, are reduced or simpli- 


fied in regular ways. 
3. Functional morphemes, like inflections for person and number, case 


endings, articles, prepositions, and so forth, are almost always omitted, 
even in contexts in which the 
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posed to be one very old lady, and we went in search of her in the old 
people’s ward, but she turned out to be home for the weekend. While 
there, we conversed with a number of other elderly patients. To be sure, 
they sometimes did not speak up quite loudly enough, but if you put your 
ear close, everything strictly linguistic was in perfect working order. 
Sometimes they were egocentric or autistic in taking for granted too 
much knowledge in the listener, and that is indeed something children 
do, but so do teachers and so at times do all of us. Maybe the pheno- 
thiazines have cured regressed schizophrenic speech, but I suspect not; 
I suspect it never existed in the sense I have described and that it was 
never childlike in its formal expressive mechanisms. I have found nothing 
in the literature from the prephenothiazine era to suggest that it was. 


REALITY TESTING 


But obviously there is something wrong with some of the things schizo- 
phrenies say and write. Not with all by any means, since schizophrenics 
are not always being schizophrenic but are only so on occasion. I have 
given three examples of assertions that I took to be deluded, and prob- 
ably most of you agreed with me. The journals and the psychiatrie wards 
echo with thousands of others, mostly more bizarre than my examples. 
But what exaetly is wrong with such a statement as, "When I get out of 
here, I'm going to fly to Scotland, where they are making the movie of 
Fiddler on the Roof, because Га really like to try out for the lead"? 
Clearly whatever is wrong is in the domain of meaning. Equally clearly, 
it is not today in the domain of linguisties; nothing general enough to go 
ina dictionary or a representation of adult semantie knowledge marks 
the quoted sentence as deviant. 

The troubles seem to begin with the knowledge of the world: (a) the 
movie Fiddler on the Roof has already been made; (b) Scotland was never 
a likely location for that movie; and (c) the patient speaking was far ^n 
young to play the lead. But ignorance of particular facts cannot be x 
Whole story. Any of us might be ignorant of one or all of them. The prob- 
lem of the delusion goes deeper than factual information. - 

Notice in the first place that the assertion must be intended wet y. 
Any Schizophrenie statement, however deluded, ceases to ms x 
phrenic if it is prefaced simply by the words “I dreamt that = | а 
ме are applying criteria we do not apply to dream reports; the ce 
one seems to be that the assertion is offered as serious enough to serve 
as a basis of action. The speaker may not act now or for years, but he 
speaks with a seriousness that implies the possibility. When the action 
implied seems imminent and would endanger the speaker or others, then, 
of course, the question of institutional commitment comes цр. | 

At this point a contrast with the fantasies or delusions of childhood is 
instructive. They can be very vivid, as we know, and in addition they may 
serve as the ground of action, but the action stays generally within play- 
ful limits. The child who says, “Look, I’m a bird,” and jumps from the 
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second step of the front porch i one thing; ve e. = says the same 

j the roof of the house is quite another. | : 
s diem ty wrong with the Fiddler on the Roof assertion as кее 
ous statement? It is not, I think, simply the ignorance it reflects ol бат 
or more of three facts, though that is part of the problem. It is rather а 
we have а sense that the true state of affairs on these three salient рош 7 
is obtained easily and disseminated quite widely in our society; most о 
us are familiar with at least one of them; many with all. It seems а а 
that a young men entertaining so large an enterprise as a trip to Scot a 
ought to have looked into the matter at least far enough to have di: 
covered that some of the propositions he is building on are false. In fact, 
we have a sense that his reality testing, to use an old phrase that still 
seems right to me, has been insufficient for the costs of the venture he 
has in mind. If it had been sufficient, he would have easily turned up 
some of the relevant truths. If we ourselves were totally ignorant of 
the truths, we would have to feel less confident in calling the assertion 
deluded, and yet, even in this case, we might judge that the probability 
of an unknown young man getting a lead in any movie is scarcely large 
enough to justify a trip to Scotland. 


This is only one example of what can lie behind the identification of a 
belief as deluded, and n 


ant upon the action contemplated. 
istorically relative notion of reality 


written that the seemingly delu 
sufficiently into the “ of the time when the imminence of 
the Kingdom of God and of the appearance of the Messiah were widely 
held beliefs, 

Reality testing of religious beliefs is not 
impossible. Only ver 


У naive prophets lik 
Riecken, & Schachter, 1956), who predict 


certain date, make 


» of course, easy and is often 
е Marian Keech (Festinger, 
ed that Chicago would be de- 
predictions vulnerable to easy 
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ie ag from her husband to help poverty-stricken strangers in remote 
ands. For people who reason morally at Kohlberg’s Stage II, in terms 
simply of satisfying one's own needs and the needs of friends and rela- 
tives, this might seem insane behavior. On the other hand, it is also 
behavior that corresponds with the highest level of moral reasoning. 
Kohlberg's Stage VI, where all human lives are held to be of equal value 
and always more valuable than property. One should perhaps require 
that psychiatrists and relatives who testify at commitment hearings be 
at least as advanced in their moral reasoning and action as the potential 
patient. Schweitzer could have made the same argument since the quite 
consistently Stage VI moral reasoning of Jesus may have seemed insane 
to certain psychiatrists mainly because they were unable to think on his 
moral level. 


EXPERIMENTAL RESEARCH ON SCHIZOPHRENIA 

In the experimental literature I have read on schizophrenic language 
and thought, there is one general finding, repeatedly confirmed for about 
15 years, that particularly fascinates me. I think William Hunt (Hunt, 
Jones, & Hunt, 1957) and Nelson Jones (1959) at Northwestern Univer- 
sity originated in the late 1950s the kind of experiment I have in mind. 
There have been later related but significant additions made by Alan 
Cohen in 1961, Brendan Maher and his associates in 1966, and probably 
quite a few others. In extracting the general result I want to concentrate 
on, I cannot describe exactly the detailed procedures and results of any 
one of the particular experiments, and I apologize for the violence I do 
them. But I think the essence of the procedure and the outcome is sup- 
ported by them all, and this seems also to be the opinion of Brendan 
Maher (Maher, McKean, & McLaughlin, 1966). In the first place, one is 
dealing in these experiments only with the linguistic products of patients 
diagnosed as schizophrenic, or in one experiment, psychotic in other 
ways. Linguistic productions of normals do not enter the picture. In the 
Second place, it is necessary to recognize, as all clinicians do, that only 
some of the linguistic productions of schizophrenics appear either disor- 
ganized or deluded; very many do not. Basically, what is done in these 
experiments is to take some short stretch of language produced by а 
Schizophrenic, bizarre or not, perhaps 50 words or 50, sometimes sponta- 
neously written or spoken, sometimes a response toa Wechsler-Bellevue 
Vocabulary item. These short stretches are put on separate slips of paper 
With no identifying information whatever, but with, 1 would suppose, 


Some automatic implication that they were seriously intended and not 
d asked to sort them, 


playful. Judges are then given the slips of paper ап ес п 
sometimes on а 7-point scale of degree of schizophrenicity, sometimes 
simply into disordered or deluded as opposed to normal. The judges 
typically have been of two sorts: PhD clinical psychologists with some 
years of experience and college undergraduates whom we unaccountably 


Still call “naive.” 
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I think there are three results of considerable interest: 

1. Most significantly, the interjudge reliability of both clinicians and 
undergraduates is consistently high, from about .70 on up, but further- 
more, the reliability of the clinicians is only slightly higher than that of 
undergraduates, always higher to be sure but not much so, and the corre- 
lations between the judgments of the two groups are consistently very 
high. 

2. Sometimes an effort has been made to study the validity of the judg- 
ments as well as the reliability, but this is not easy since the criterion 
is difficult to fix in view of the substantial unreliability of psychiatric 
diagnoses. Still, efforts have been made. One study (Cohen, 1961), for 


of adequate reality testing, 


just as some Middl i 
as ungrammatieal е English 


like grammar, is 
: sentences would 
today, so some Middle English beliefs and 
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reasoning might strike us as deluded or insane if put forward, not in a 
play, but as a basis for serious action. This does not imply, of course, that 
judgments of either ungrammaticality or delusion result from any sort 
of conspiracy to suppress dissent. Nor does it seem to me to imply that 
mental or neurological illness is a mistaken category for departures from 
what are admittedly relative standards. Especially not, since the intent 
of the category is to secure more humane treatment than is associated 
with alternative categories. The standards evolve, but a departure re- 
mains a departure even though it is not one in any absolute and eternal 
sense. 

A difference, of course, between the ungrammatical and the delusional 
cases is that, with respect to the grammars, there are reasonably full, 
explicit formulations of the rules that govern judgment. With respect 
to thought, explicit formulation is very spotty and perhaps can never 
approach completeness. The most relevant formulations that exist are 
not, I think, the logic and the principles of evidence taught in schools, 
though these are canons to which we make ultimate appeal. But I should 
think the most relevant efforts to make explicit the notions of rationality 
on which we operate are the efforts to describe psychological implica- 
tion, principles of affective consistency, principles of attribution, and 
the common fallacies of logic and of evidence evaluation. All of these 
seem to be efforts to describe rules we really use rather than rules we 


ought to use, and it seems chiefly to be the rules we use that cause propo- 
sitions to be considered deluded. It is perhaps also the departures from 


these rules that indicate mental illness. Obviously only a small beginning 
has been made in the task of making explicit the rules and knowledge by 
which we judge rationality, but that does not prevent us from making 
the judgments and making them with a considerable degree of agreement. 

Every sort of disease represents a deviation from a norm; in somatic 
diseases it is a deviation from the norms of the organs and functions of 


the body— norms that are often uniform for the species. In mental di- 
belief, perception, and 


Sease it is a deviation from the norms of action, perce ‹ 
во on—norms that are, at least in part, culturally and historically vari- 
able. There have been a great many attempts to describe the nature of 
the basic deviation in schizophrenia: conereteness or overinclusion or 


privateness in concept formation; overgeneralization in learning; para- 
logie in thinking; associative intrusions in language; a failure to filter 
what is irrelevant in attention, etc. In general, that part of the vast quan- 
tity of research supporting these ideas that I have been able to read 
Seems to me rather disappointing (with the qualification that I think a 
better ease can be made for attention than for the others) The results 
Seem to me generally disappointing in that they tend often to be incon- 
Sistent across studies and, even when consistent, usually yield small, 
barely significant differences between schizophrenies and normals. Of 
course, there are any number of possible reasons why the experimental 
outcomes should be so confused: failure to specify schizophrenic sub- 
types, failure to say what if any medication was in use, unreliability of 
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diagnosis, and so on. But after reading quite a lot of this work, I cannot 
help wondering if there is not a more fundamental difficulty. The tasks 
used have typically been well-established laboratory tasks having slight 
and very occasional personal interest to any subject. It is as if the as- 
sumption were that schizophrenia resulted from an across-the-board, 
content-free impairment of а Базе function like perception, learning, 
concept formation, or attention. If that were indeed the case, then it 
would make no difference what task was used to test the function, and 
convenience might as well dictate the selection. But is this the case? 
It may be that we have here been too much influenced by the disease 
concept. Lacking psychic organs comparable to the heart, liver, and 
Spleen, we may have made too much of the unity of our functions. In my 
conversations with schizophrenics, it was not my impression that some 


particular function had become impaired in a content-free way. Quite the 


contrary, it seemed that when one hit upon or even approached the dis- 
turbed content area, any or all 


functions might suffer disruption. And 
the differences from normals were not slight. The average laboratory 
task, possibly any laboratory task, may elicit a lower performance from 
| schizophrenics than from normals because it touches here and there on 
something that matters to one or another subject, and this will happen 
a bit € often and will be more disruptive for schizophrenics than for 
normals. 


What should cause some peo 
of reality testing and rationali: 


balance, incongruity, i i 
mer dence i | s 1 attitude change. No one has 


-bind message of your own, for instance, 


ailable for communication. Peo- 
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ple do it all the time. Or leave the field. Or disre, 
і time. . Or rd 

me m m role of the double bind at all Plausible a 
Shee nee , one has to explain why some people cannot react in any of 
ues А — н s but must feel both sides of the bind intensely, be 
reels i Lin kind, or leave the field. If these are the problems 
ue T de = i 18 not the existence of the bind as such that causes ik 
= Ў uld per haps add that by my view of the mind-body problem 

oes not deny biochemical causes m espousing family interaction 


cau ү i i 
ses. Of course, there are biochemical causes, and it would not surprise 


me at all if the biochemists got there first with a precise etiology and an 


effective therapy. 
All of this logic chopping leaves me with the feeling that I have been 


e to do nothing in return for the kindness of the persons called schizo- 
Nee ~ whom I met during my three weeks outside the office. And I 
€ y regret that, because I found myself sympathetic to all of them 

pained by the suffering many endure each day. I cannot forget the 
"mem remark made by the young patient in the novel J Never Prom- 

еа Tou a Rose Garden, à remark made to her intellectualizing psychi- 
atrist: “Doctor, my difference is not my disease." Cognitive psychologists 
cannot help being interested also in the differences and must hope pa- 


tients will forgive them. 
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8. PSYCHOSURGERY: 


A Multilevel Controversy 


In 1936 under the direction of the Portuguese physician Egas Moniz, 
the first recorded psychosurgical operation on a human being was per- 
formed by Almeida Lima. This operation, termed a prefrontal leukotomy, 
involved the removal of several cores of brain tissue from the frontal 
lobe of an individual classified as schizophrenic. Although it is somewhat 
uncertain just what the result of this particular operation was (the pa- 
tient survived), Walter Freeman and J. W. Watts introduced a modified 
frontal lobotomy technique into the United States the following year. 
It is estimated that during the period between 1936 and 1955 as many as 
50,000 such operations were performed in this country for various rea- 
sons, primarily in efforts to alleviate the worst symptoms of agitated 
schizophrenia. However, many other conditions were also treated, in- 
cluding depression, homosexuality, and narcotic addiction. Psychosur- 


gery is brain surgery performed with the intention of altering behavior. 
те was no indication of any disease 


In the great majority of these cases the 
or damage to the brain. Brain surgery to remove diseased or damaged 
tissue is not properly termed psychosurgery. р 

Naturally, such radical procedures received considerable attention, 
but from the beginning it has been extremely difficult to obtain objective, 
reliable information as to the outcome of such surgical procedures. Too 
often, the only information available is from the neurosurgeon who per- 
formed the operation. Can you anticipate the problems of getting reliable, 
objective information in such cases? In many other cases preoperative 
data is sketchy or unavailable, and postoperative reports are simply that 
the patient is improved, about the same, or worse. 

The authors of the following two articles address themselves to some 
of the issues and controversies in this area. As you will see, they deal 
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8 5 
with a wide range of issues: medical, legal, and practical €: em 
how to get adequate data in order to evaluate the еп aa aC go 
surgical procedures. Although the number of such a mene d 
creased in the United States since the 1935-55 period, t P fue gus 
Several neurosurgeons performing such operations, perhaps an о in 
dred a year in this country. Worldwide, psychosurgery к, ойн 4, 
many other countries, such as West Germany, Denmar › Sorter 
Canada, India, Switzerland, Spain, France, Mexico, Japan, 

Finland, Italy, and Norway. _ 

In this te the state laws about who can perform ка, 
surgical operations for what reasons and on whom vary ene nig 
A class project may be based on these questions: What are t ^ senes di 
lating psychosurgery in your рав mu is the current freq 

Surgery? How could you find out? : > 

kc Brown, Louis A. Wienckowski, and Lyle W. Bivens ied 
the current status of psychosurgery from a somewhat up boe 
point. Writing for the Department of Health, Education, and We. — 5 
they point out the recent history, current controls, and necessary РА 
guards for the future for patients recommended for etes кы, ^ 
you read this article, see if you ean define the major problems => та 
trolling psychosurgery. Also, try to define the issues that go beyon 
borders of academic psychology. 

The article by Vernon Mark is 0! 
psychosurgeon fo 


Possibility for further reading is E. Valen- 
stein’s book, Brain Control (1978). 


One of the nagging problems related to psychosurgery (and other pro- 
i rmed consent.” If the informed consent of the patient is 
required for such Surgery, as many have proposed, what are some of the 
difficulties that may arise? 


Wh 


: те informed 
at are some possible problems if informe 
consent is not required? 


One state that has wrestled with this problem is Oregon. Oregon has 
a quite detailed and elaborate law regulating psychosurgery. Oregon 
Senate Bill 298, which took effect on July 1, 1973, may serve as a mode 
for other state legislatures, 


In addition to the legal and ethical issues, there are other points of 
controversy. The si 


ght be the role of a psychologist 
ersial issues? 
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PSYCHOSURGERY: PERSPECTIVE ON 
A CURRENT ISSUE 
Bertram S. Brown, Director, 
National Institute of Mental Health 


Louis A. Wienckowski, Director, 
Division of Extramural Research Programs 


Lyle W. Bivens, Chief, Neuropsychology Section, 
Division of Extramural Research Programs 


g up their minds. They would rather 


tion than a dozen facts. 
vitt, Voyages and Discoveries, 1939 


People don't ask for facts in makin 
have one good, soul-satisfying emo 
—Robert Keith Lea 

The control of human behavior has become an issue of considerable 
concern and prominence. Most of the concern is generated by a rapidly 


advancing biomedical and behavioral science technology that provides 
behavior and thus carries 


explicit and powerful means for controlling 

with it a correspondingly increased potential for abuse. Much recent 
publicity has focused on the control of behavior by removing or destroy- 
ing portions of the human brain, a procedure usually called psychosurgery 
(Breggin, 1972a, 1972b; Holden, 1973). It also is called psychiatric sur- 
gery or functional neurosurgery. Psychosurgery is not a new technology. 
It has been used for nearly forty years to treat a variety of mental ill- 
nesses. But the dramatic and drastic nature of psychosurgery has aroused 
many concerns about its potential abuse. The following discussion 15 
intended to provide an objective account of some of these concerns as 


Well as to offset some of the misconceptions that have been fostered by 


recent and strongly critical publicity. 


WHAT IS PSYCHOSURGERY? 


be defined as surgical removal or destruction of 


Psychosurgery can 
brain tissue or the cutting of brain tissue to disconnect one part of the 
Louis A., and Bivens, Lyle W. Psychosurgery: Per- 


Brown, Bertram S., Wienckowski, 
Spective on a current issue. National Institute of Mental Health, Alcohol, Drug Abuse, 
апа Mental Health Administration, U.S. Department of Health, Education, and Welfare. 


Reprinted with permission. 
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- ue itis per- 
brain from another with the intent of altering pm чаан E 
formed in the absence of direct evidence of existing stru 
e in the brain. on айй 
~ surgical treatment of epilepsy, while їп опе wer s — 4 
surgery since behavioral symptoms аге altered, shou e mud there № 
this discussion when the disease can be clearly diagnose chology in the 
convincing evidence that epilepsy is caused by organic ра ir ae aiae 
brain. Of course, any other neurosurgical treatment us ep ===> 
damaged brain tissue ог to remove tumors is not psye = p ee 
In neuroanatomical terminology, the suffix -otomy re er Lobotoni, 
Struction of brain tissue or the cutting of fibers in the br Tes bndaf 
therefore, is the destruction of tissue in the frontal lobes | t ake 
the cutting of fibers connecting the frontal lobes with е habes 
brain. Thalamotomy refers to destruction of portions о LE. с Гоша 
The suffix -есіоту refers to the actual removal of tissue. “ne Topec 
lobectomy would be the removal of the frontal lobes of the "s dium 
tomy refers to selective removal of portions of the cortex, us 
the frontal region of the brain. | 
Ever since the first “radical” lobotomies were performed Mp eed e 
in which virtually all of the subcortical connections of the fron и 
were severed, surgeons have attempted to refine the techniques Tope 
the brain destruction in the hope of reducing adverse side — di d 
tomy, which damaged less brain tissue than the early lobotomi cbe: 
tried. Then a further refinement of topectomy was the developm 


: ions of a 
the undercutting techniques whereby the subcortical connection 
selective part of the corte 


i left 
х were severed but the cortex itself was 
intact, _ 
А CN sum 
This search for limitation of destruction of brain tissue (and pre 
ably of ad 


verse side effects) culm 
а small electrode is positioned i 
X-ray inspection are used to pla 
then the tissue is destroyed a 
through the electrode. With th 
structures deep within the ај senlace 
amygdalotomy, thalamotomy, and hypothalamotomy began to rep 
lobotomy as psychosurgical procedures, А hizo- 
Psychosurgery Was recommended for curing or ameliorating sc lit 

phrenia “Lobotomy,” 1969), depression (Knight, 1969), м 
“Homosexuality,” 1970), childhood behavior disorders (Andy, 19 68: 
criminal behavior (Vaernet & Madsen, 1970; Koskoff & Goldhurst, 19 ҮЙ, 

d a variety of narcotic addiction (Smolik, 1950); Sea: ж 

gh, & Angelucci, 1952), and other psy¢ 

atric problems. For а period 


inated in stereotaxic techniques in ни 
n the brain. Geometric coordinates an d 
ce the electrode in a precise location, + 
t the electrode tip by passing a eciam 
e development of stereotaxic techniqu a 
in become accessible for destruction, а! 
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а to manage and totally disabled in terms of functioning in 
я de а = ge number of psychosurgical procedures were performed 
ae a (perhaps as many as 50,000 between 1936 and the 
iin 2 hese operations were especially popular in England, and 
а as 1962 an English study estimated that between 400 and 525 
perations a year were being performed there (Pippard, 1962). 


EVALUATIONS OF PSYCHOSURGERY 
aoe with the first lobotomy by Egas Moniz in Portugal in 1935, 
е was consider able criticism of the technique and of the exaggerated 
-— by its proponents of the curative power of lobotomy. This criticism 
ad little effect on the psychosurgeons, who continued to ignore or gloss 
over the side effects of their procedures and who continued to accept the 
view that “it is better for the patient to have a simplified intellect capable 
of elementary acts, than an intellect where there reigns the disorder of 
Dum synthesis. Society can accommodate itself to the most humble 
T but it justifiably distrusts the mad thinker" (Freeman & Watts, 

In 1947 an exhaustive study of 48 mental patients, 24 of whom had 
undergone some form of psychosurgery (lobotomy or topectomy) was 
undertaken by a group of scientists called the Columbia-Greystone Asso- 
ciates. An extensive series of medical, psychological, and physiological 
tests was conducted on these patients, and the results reported in 1949. 
Ina follow-up report 3% years after the original study, the status of the 
original patients in the study was reviewed. Neither the original nor the 
follow-up study was able to provide any substantial scientific evidence 
that these psychosurgical procedures were useful (Mettler, 1952). 

A much larger project in England evaluated 10,365 patients subjected 
to lobotomy (Tooth & Newton, 1961). The study revealed that about half 
of the operated patients remained in the hospital at the time of the survey 

ovided, and since 


in 1957. Little or no data on psychological changes are pr 
f psychiatric disorders, one 


the operated patients spanned a wide range о 

could speculate that the patients that were discharged might have been 
discharged even if they had not undergone surgery, since many psy- 
chiatric problems show а tendency to disappear without (or despite) 


treatment. 

Many smaller studies, using small n 
psychiatric or behavioral disorders, have been reported in the literature. 
Surgical techniques range from lobotomies to the implantation of radio- 
active pellets to destroy deep brain tissue. Although many of the pub- 
lished studies are deficient in methodology and clinical reporting, there 
are several that conclude that psychosurgery can be useful in selected 
cases as a “last resort treatment” (Koskoff, Dennis, Lazone, & Wheeler, 
1947; Miller, 1967; Strom-Olsen & Carlisle, 1971; Tan, Marks, and Marset, 
1971; “Surgical Modification,” 1971; Kalinowsky, Brill, Freyhan, Cattell, 
& Ransohoff, 1969; Scoville, 1969; Shobe & Gildea, 1968). One study goes 


umbers of patients with varied 
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ice” in 

so far as to suggest that leucotomy may be the “treatment of px 
certain cases, rather than being used as a last resort (Hohne 

0). . 
p m it is easy from our current vantage point to а њиме = 
“lobotomists,” it should be pointed out that they were = en 
genuinely concerned professionals who were trying to x - cn 
suffering from a devastatingly serious psychiatrie disease. rimis 
that it was ethical and reasonable to change an immobile, ШШ оре” 
tive, or destructively hyperactive patient into a placid “house od 
by psychosurgery is not entirely without merit. The most extreme Might 
toms of mental disease which they were faced with on a daily basis n 
have made the most ethically concerned and moralistic of us reso 
desperate measures. 

Th fact, lobotomy had a clear, albeit limited, usefulness. Lobotomy bo 
а “therapy of desperation,” but it frequently improved the гаа 
patients suffering from severe mental illness, either by oe pe тей 
condition within the hospital or by permitting them to lead usefu ES 
outside the hospital. The popular conception that lobotomy turns a per S» 
into a "vegetable" is not true. Many of the psychological wee == а 
duced by lobotomy are often much more subtle than is popularly be 05 ^" 
(Smith, 1960; Poppen, Pribram, & Robinson, 1965; Smith & Kinder , Бе 
Hamlin, 1970). Faced with the choice of leaving a patient raving in ме 
back ward of a mental hospital, soiling himself with his own feces, a а 
attacking other patients and staff or performing a lobotomy with a 6» s 
sonably good chance of improving his condition so he can be moved rhs 
ward with better conditions, many reasonable people would make я 
latter choice. Although in many cases lobotomized patients who wer 
able to be discharged from the hospital functioned at a lower neat тим 
level than prior to the onset of their mental illness, this was better as 
the realistic and grim prospect of spending the rest of their lives in 
hospital. 


, had an abrupt onset of delusions zd 

ed by enemy agents and had seen and spoken w1 

Dr. Goebbels, Winston Churchill, and the Archbishop of Canterbury- 
i i she was withdrawn or 


In the many case histories Where lobotomy has had a beneficial effect, 
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particularly for schizophrenic patients, this seems to be the typical pat- 
tern—improvement in personal habits, reduction of hallucinatory or 
delusional behavior, and improvement in social behavior. But because 
thought disorders and disorientation persist, the patient cannot really 
effectively function in the outside world and remains in the hospital envi- 
ronment. For other types of mental illness such as obsessive-compulsive 
neurosis, lobotomy more often relieved the symptoms to a sufficient 
degree that the person could be discharged from the hospital and return 
to productive work. 

However, there were occasional disastrous effects of lobotomy (or of 
other forms of psychosurgery) on personality or intellectual function, 
and these cases were the source of much of the strong criticism leveied 
* lobotomy. An illustrative case is one from a series of medical temporal 
obectomies performed on a group of schizophrenie or epileptie patients 
that resulted in devastating memory deficits. The case of H. M., who 
was operated on for relief of epilepsy, is described as follows: 


The patient was drowsy for the first few postoperative days but then, as he became 
more alert, a severe memory impairment was apparent. He could no longer recog- 
nize the hospital staff, apart from Dr. Scoville himself, whom he had known for 
many years; he did not remember and could not relearn the way to the bathroom, 
and he seemed to retain nothing of the day-to-day happenings in the hospital. At 
the same time, it was discovered that he had an extensive though patchy, retrograde 
amnesia, such that he did not remember the death of a favourite uncle three years 
before, nor anything of the period spent in the hospital three years before the opera- 
tion, but did recall a few trivial incidents that had taken place just before his ad- 
mission to the hospital. His early memories were seemingly vivid and intact, his 
speech was normal, and his social behaviour and emotional responses were entirely 
appropriate. 

Some 12 years later, 
... His mother states now, as she did in 195: 
Objects constantly in use are kept; for example, although he mows the lawn regu- 
larly, and quite expertly, she still has to tell him where to find the lawnmower, even 


when he has been using it only the day before. His mother also observes that he will 
tice effect, and 


do the same jigsaw puzzles day after day 

read the same magazines over and over а, i ing their contents 
familiar. The same forgetfulness applies to рео! 
even to those neighbours who have been visiting the house regularly for the past 
Six years. He has not learned their names and he does not 
he meets them in the street. (Milner, 1966). 


the patient's condition was unchanged: 
5, that he is unable to learn where 


Similar, but somewhat less severe memory defects were observed in 


other patients in the series. 

But it was not the evaluations of psychosurgery or the ethical issues 
that led to its decreased use. The event that had the greatest impact on 
reducing the number of psychosurgical procedures was the introduction 
of chemical therapy for mental illness. With the advent of the tranquil- 
izers in the mid-1950's, psychosurgery lost its position as the major 
Somatic treatment for severe mental illness. Psychotic behavior that 
had previously been controllable only by psychosurgical treatment could 


SY 
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i mical 
now be controlled by less drastic, and usually more effective, che 
therapy. 


CURRENT PSYCHOSURGERY 


erer uch 
It is difficult to estimate the current extent of patios В 
of the psychosurgery today is performed by private praet ae, does 
there is no way to obtain any reasonably accurate figures. noaaraa 
seem to have been some increase in the frequency of psyc f pharma- 
operations over the past 15 years, probably due to the о а реј а nid 
cotherapy to fulfill the earlier expectations for it. Current y ЕА limbic 
geons appear to be concentrating on destruction of portions of yedik 
system of the brain, which animal research has shown to bs. m ової 
emotional behavior. Interruption of the cingulum bundle or aout to 
the cingulate gyrus—parts of the cortex which early workers t а to 
be a brain locus responsible for the experiencing of v pe i b 
be a particularly popular procedure (Ballantine, Cassidy, F tan E the 
Morino, 1967; Brown & Lighthill, 1968). Destruction of lees таа 
thalamus (Mark, Barry, MeLardy, & Ervin, 1970) or hypothe ref iil 
(Schvarez, Driollet, Rios, & Betti, 1972), other structures implica 
emotional behavior, has been used. . Yogress 
Over the past two decades, scientists have made considerable p gom 
in defining and understanding the functional role of the fronta 1972; 
and of various other parts of the brain (Isaacson, 1972; Thorne, 


Roos, 
Rosen, Stein & Butters, 1971; Pribram, Ahumada, Haryog, & 


ucted 
1964). Almost all of the basic research in this area has been cond 
with animal subjects. For ex 


. al 

ample, a scientist supported by the ered 

Institute of Mental Health has localized areas in the cat brain, vedi 

When electrically Stimulated with permanently implanted elec nother 

to mount a directed and well-integrated attack on an cat 

f such electrical stimulation, 1e can 

у, stimulation of other brain ar ш be- 
g an aggressive attack to stop an 
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see if whatever behavior they are trying to eliminate can be elicited 
by the stimulation. In other cases, stimulation of only those brain areas 
thought to inhibit aggression is performed; any procedure based on stim- 
ulation is preferable, because electrical stimulation generally produces 
less brain damage than lesions. In any case, whether lesion or stimula- 
tion techniques are used, the modern psychosurgeon attempts to mini- 
mize the adverse side effects of the procedures by lesioning or stimulating 
only a small area of brain tissue, with the hope of eliminating the specific 
behavioral problems but not producing any undesirable changes in per- 
sonality, intelligence, creativity, or motivation. 

An illustrative case history is provided in a book entitled Violence and 
the Brain by Vernon H. Mark, M.D., and Frank R. Ervin, M.D. (1970). 
They cite the case of Julia, a 21-year-old girl who had on several occasions 
assaulted persons for no apparent reason. She was also subject to fre- 
quent severe temper tantrums and psychotic episodes, and she made 
several unsuccessful attempts at suicide. Julia’s troubles can apparently 
be traced back to a severe attack of encephalitis following mumps before 
She was two years old. Since the age of 10, she had been having epileptic 
Seizures. After several serious assaults, including stabbing another girl 
in the ladies lounge of a movie theater and stabbing a nurse ш а mental 
hospital with a pair of scissors, and after numerous unsuccessful attempts 
to treat Julia with psychotherapy, electroshock, and a variety of drugs, 
she was admitted to Boston City Hospital for clinical evaluation. Elec- 


trodes were implanted in the temporal lobes of her brain, from whith 
Seizure discharges were recorded. By electrically stimulating the brain 
re able to elicit symptoms that were 


areas the attending physicians we І г 
characteristic of кка А of her seizures. A lesion was made in 
the left temporal lobe, which seemed to be the side from which пеш 
toms could be most easily produced. However, this lesion did no =: 4 
to improve her condition. Using radio-stimulation techniques, in w E 
an implanted electrode in the brain can be stimulated without aei rd 
wires or leads connected to the patient, the physicians i al pis 
right temporal lobe and elicited fullblown rage reactions d eue 
during which she would attack the wall of her room or attack е m 
person if anyone was present. Abnormal electrical M as eni E 
corded just prior to the rage attacks. Another lesion was t pee we 
the right side and in the two years since the second wey : ее 
Julia has had only two mild episodes of rage. ali е ер 
seizures and her psychotic behavior have been unchange - "— 
Most of the above discussion has focused on aggress E dun 
rather than on the many other forms of mental Taa н dapes ri 
hreni ession. Alth а 
кк of н тм or other serious mental problems, currently 
it is principally the uncontrollably aggressive patien dom bu qe 
prime candidate for psyc с 5 | 
most disruptive and euentum friends, and hospital staff. 


Y 
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е 4 e peo- 
The serious disturbance of the patient’s own life and ei = о A 
ple around him and the potential for bodily harm that i: especially 
himself or others often leads to treatment of a с ON " 

when other treatments, partieularly drugs, seem to fail. 


PROGRAMS OF THE NATIONAL INSTITUTE OF MENTAL HEALTH 


rongly 
Since 1947 the National Institute of Mental Health hag y bom E а 
committed to a broadly based research program on br nS oe 
behavior. All of the mental states of man are related to р dem hate at 
occurring within the brain. Emotions such as love, s А setivity i 
generally believed to be reflections of chemical and electr ves System: 
the billions of nerve cells that comprise the central я паз 
When man or animal eats, drinks, or engages in sexual ac sae peat 
drive states are a function of activity of the brain. The Ен mellis 
complexity of the brain, not just of man but of кже ашпа and: реке 
a distinct challenge both to Базе DE and to scientists 
tioners devoted to the Study of mental illness. . nts 
If normal behaviors are produced and controlled by physica ae 
in the central nervous system, then pathological behaviors са In 
result of these physical processes Operating in pathologica ential to 
order to understand the root causes of mental disease, it is pe bae 
understand the physiologieal events that Shape and control = is afi 
haviors. Consequently, Support of basie brain-behavior resear 


: : Health. 
integral part of the activities of the National Institute of Mental 
One of the first 


gram of the NIM 
in the learning process. Since that 
of the Institute h 
support for neuropsycholo: 
intramural research Г 
research on the physi 
chemical systems of the brain, 

Despite the great 
tionships, a great de. 


i ; in-behavior 
important problems regarding brain aes 
eing investigated in research laboratories 
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the country. For example: What chemicals are responsible for trans- 
mission of nerve impulses within the brain, and how do these chemicals 
control different forms of behavior? What chemical or physical changes 
occur when learning takes place or when a memory is recovered from 
the brain? What anatomical structures and what chemicals are respon- 
sible for aggressive behavior, sexual behavior, eating behavior, drinking 
behavior, and for sleep and dreaming? How are the brain and central 
nervous system related to the autonomic nervous system, which controls 
many life processes such as heart rate, blood pressure, and a wide variety 
of visceral functions. What happens to the behavior and cognitive pro- 
cesses of man when portions of the brain are damaged through accident, 
birth injury, malnutrition, or disease? Can functions that have been lost 
through such disruption of the brain be recovered or taken over by other 


portions of the brain? 
These and many other lines о 


grantees of the National Institute о: 
intramural research program. However, as each specific question is 


answered to some degree, several other questions arise as a result of 
the research, As more and more is learned about control of behavior by 
the brain, questions asked by brain scientists become more specific and 
more amenable to experimental tests. We are thus well on the way to 
building a solid framework for understanding mental processes. 
Although most of the brain-behavior research supported by the In- 
stitute involves experiments with animal subjects, some studies involve 
human subjects. Most of the human studies are aimed at understanding 
the behavioral consequences of altered brain function as a result of war 
injury, accident, disease, or surgery performed because of clinical neces- 
sity (e.g., removal of brain tumors). Thus neurological patients are stud- 
ied with behavioral tests in order to define and understand the sometimes 


subtle effects of insult to the brain (Sperry, 1973; Teuber, 1968). In these 
cases there is no experimental manipulation of the brain itself; the sub- 
jects are selected on the basis of pre-existing brain dysfunction, damage, 


or clinical surgery. 


THE NIMH STUDY ON EPILEPSY AND VIOLENCE 


The single project supported by the NIMH which came closest to 
psychosurgery was a contract, in the amount of $500,000 to the Neuro- 
Research Foundation in Boston, Massachusetts, which began on June 25, 
1971, and terminated on March 31, 1973. The contract involved an at- 
tempt to understand and control violent and destructive behavior that 
is presumed to be a direct result of brain dysfunction. 

The research being conducted under this contract was based on an 
hypothesis that some forms of mental illness, especially those charac- 
terized by violence or hyperactivity, are the direct result of organic brain 
disease. Based on evidence from animal studies showing relationships 
between the limbic system and emotional behavior, the limbic system of 


f research are being actively pursued by 
f Mental Health and in the NIMH 
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the brain is usually singled out as the area most likely to be involved 
(Wasman & Flynn, 1966; Ellison & Flynn, 1968). There are also some 
scattered and anecdotal reports in the clinical literature on the effects 
of damage to the limbic system by tumors or by war injury or accident 
in which a predominant Symptom is violent behavior. A few studies have 
purported to show that there is a high incidence of abnormalities in the 
electrical brain activity (EEG) of criminal populations, particularly їп 
those subjects who have repeatedly committed senselessly violent acts 
(Mark & Ervin, 1970). Unfortunately, interpretation of the EEG is a 
difficult task, and the meaning of any given type of “abnormality” is an 
Open question in view of the present inadequate understanding of the 
electrical processes of the brain. In any case the main point is that the 


psychosurgeon believes that there is an organic basis for disordered 
behavior. 


Certain forms of epilepsy, 
trical seizure activity in the 
geons as a model for explain 
violent behavior. The suppo: 
takes place in a part of the 
behavior, In fact, there ha 
tional and behavioral state 


disturbances), 


The group at the Neuro-Research Foundation hypothesized that for 


some of those persons who exhibit periodic uncontrollable violent be- 
havior, a process akin to psych 


source of such brain dysfunctio: 
system, and thus the primary 
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automatice and mor -di А 3 
sina, he m m disturbed behaviors seen in psychomotor 
function which th 5 group was to try to identify the organic dys- 
The main t „> p Ше presumed is responsible for violent behavior 
fis ртАш e е used is to implant electrodes in various parts of 
dibus dide: to identify abnormalities, particularly seizure-like 
Этна ee үс gee noe behavior. If an apparent 
: ч ound, an this seizure activity appear: 
ү үү шш of aggressive behavior, then а oral lesion + 
feces wd ot es : 44 tissue at the seizure focus. Ideally, this restricted 
А, i, e т activity at its source. Whether or not this 
ning of this pa = is de e ro mien № defined in the begin- 
m dens ie pe аш. though а surgery is done to treat 
of the brain: , р у intended to stop the seizure activity 
M ae се » this seemingly straightforward and rela- 
а ede о а аел ES Eu e first place there remains some question 
even correlated with a eun as the brain is directly responsible for or 
ә п aggressive behavior. There are no firm scientific 
data, either from animal or human studies, to demonstrate а close rela- 
tionship between either focal or generalized seizures and aggression. 
Therefore, one of the aims of the project being conducted by the Neuro- 
Research Foundation was to examine the degree of correlation between 
electrical seizures in the brain and aggressive or violent behavior. 

One factor that makes this kind of research especially difficult is that 
simply admitting an epileptic patient or a person exhibiting violent be- 
havior to a hospital and placing him in a treatment environment may 
greatly reduce the frequency and/or severity of the symptoms, in the 
absence of any medical treatment or drug regimen. Thus it is difficult to 
study a phenomenon that often disappears or appears much less frequently 
than before as soon as the patient is put in a treatment environment. 

Factors such as this led the research team of the Neuro-Research 
Foundation to focus more on defining the environmental situations that 
trigger seizure activity or aggression and to test the value of psychiatric 
treatment, such as group therapy, in helping patients to deal with and 
control their periodic outbursts of violent behavior. In fact, during the 
course of this contract no psychosurgery was carried out (although one 
chronically epileptic patient did receive a brain lesion). The research 
emphasis was on electrical recording studies, correlation of electric brain 
activity with various behaviors, and psychiatric treatment. | 

One of the primary issues in this area, a$ in all research projects deal- 
ing with human subjects, is the protection of the rights of the individual. 


A project which involves medical or surgical treatment in combination 
with psychiatric treatment and deals with patients who are severely 
debilitated in terms of their capacity to function in society presents es- 
pecially thorny issues of ing the physical and psychological well- 
being of the patient. An equally important ethical issue i$ the scientific 

ince if the studies are not scientifically 


merit of the proposed research, sin 
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sound and are not likely to yield important and useful data, then the 
human subjects have been exposed to medical or psychological risks that 
are not outweighed by the potential benefits. To illustrate how important 
these issues are, we shall outline in some detail the scientific and ethical 
review process involved in establishing the contract with the Neuro- 
Research Foundation and describe the continuous monitoring of the 
project. . 

At the FY 1970 HEW budget hearings, Dr. William H. Sweet, Chief 
of Neurosurgery, Massachusetts General Hospital; Dr. Vernon H. Mark, 
Director of Neurosurgery at the Boston City Hospital; and Dr. Frank R. 
Ervin, Department of Psychiatry, Harvard Medical School, appeared 
before the Congressional Appropriations Committee and requested 
funding for partial support of a facility designed to study violence in 
relation to temporal lobe epilepsy. The Study involved the control of 
rage seizure through the implantation of electrodes into the brain and 
gradual destruction of tissue in the amygdala and other Structures in the 
temporal lobe. Other projects planned for the program were studies of 
chromosomal and dermatoglyphie (fingerprint) abnormalities as possibly 
related to epilepsy. The Congress earmarked $500,000 from Chronic 
Disease Program funds for this study. 

As a result of expenditure controls in FY 1970 the Neurological and 
Sensory Disease Program in the Chronic Disease Program, which was 
relevant to such a special project, was eliminated. Consequently, the 
funds were not awarded in the first year. However, the FY 1971 budget 


for the National Institute of Mental Health contained $500,000 
marked for funding this project. The 


the Congressional Report makin 
of Labor and Health, Education, 


ear- 
following paragraph is quoted from 
appropriations for the Departments 
and Welfare for FY 1971: 

Last year the Congress determined that a 
violent behavior resulting in death or crit 
of discovering medical means for preve 
violence. The Congress directed the Chronic Di 


t priation be devoted to the first 
year operations of a study of violent behavior, in keeping with its intent of last. year. 
(U.S. Senate Report, 1971) 


In response to this Congressional direc 


to negotiate a contract for the Support of this work. Discussions were 
held by phone, letter, and visits with Dr. Sweet concerning the proce- 
dures he would need to follow and the materials he would have to submit 
in order to obtain these funds. Arrangements were made for a site visit 
to the Boston City Hospital; the visiting team included two outside con- 
sultants who were experts in this area of r 


esearch, as well as a represen- 
tative from the NIH Office of Institutional Relations, whose principal 


tive, staff of the NIMH began 
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responsibility is the protection of the rights and welfare of human sub 
jects. The visit occupied a major portion of the day and included a nee f 
the facilities as well as intensive interviews with all of the relevant staff 

Prior to and during the NIMH review of the project, a series of revi и 
were conducted by a number of internal institutional committees at Har. 
vard Medical School, Massachusetts General Hospital, and Boston City 
Hospital. These reviews were necessary to fulfill a Public Health Service 
requirement that projects dealing with human subjects must have the 
approval of an institutional committee. 

In all candor, it must be admitted that there was considerable contro- 
versy during the review of this project: indeed one would be concerned 
if there were not controversy in the review of a project so fraught with 
ethical complexities. Ultimately, however, the project was approved by 
a committee of five senior scientists and clinicians from the Harvard 

Hospital, and other hospitals in 


Medical School, Massachusetts General 
the Boston area, and some of the controversial issues that arose during 


the NIMH review of the project were resolved by extensive further 


documentation and correspondence provided by the Neuro-Research 
Foundation. The award was made with the proviso that there be a thor- 
ough and continuing examination of the procedures and results of the 


project. It also is of interest that reviewers of the proposal felt that an 
important byproduct of the project was an opportunity to develop pro- 
cedures to provide the necessary broadening of the processes of review 
increasingly demanded by research and development in the behavioral 
and biological sciences. 

The procedures established for the protection of the human subjects 
in this project were designed to assure that the patients had problems 
severe enough to consider them as candidates for drastic treatment mea- 
sures, that the patients were fully informed (insofar as possible) as to the 
potential risks involved, and that every conceivable measure of protec- 
tion of the patients’ rights and well-being was exerted. For each and 


every patient admitted to the program, а review committee composed 
of scientists and physicians not associated with the Neuro-Research 
Foundation would review the individual case and approve or disapprove 
his admission to the program. If a patient were admitted to the program, 
it would be only for psychiatrie and medical evaluation studies, possibly 
including depth electrode studies of brain seizure activity. If on the basis 
of these studies the investigators felt that any brain lesioning procedures 
d, additional review by another outside committee would 
be required. me : 
During the tenure of this contract continuous monitoring of the project 
was conducted by NIMH staff and outside experts, including regular 
quarterly reports, continued contact by phone and mail, and periodic site 
visits. The emphasis on nonsurgical and psychiatric treatment provided 
encouraging indications that at least some patients exhibiting uncontrol- 
lable violent behavior can be helped without resorting to irreversible 
surgical treatment. 
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PSYCHOSURGERY, BEHAVIOR CONTROL, AND ETHICS 


With very few exceptions, medical scientists and clinicians oppose the 
“classical” lobotomy. As previously discussed, there is no sound evidence 
that it is a worthwhile procedure, and it too often has deleterious effects 
on personality. Actually, lobotomy is not the issue; as techniques for 
more refined brain recording, stimulation, and lesioning have become 
available, and as the literature on animal brain research has begun to 
define, however vaguely, functions of specific brain areas, practition- 
ers are more frequently considering surgical treatment for behavioral 
problems. 

As with any controversial issue, there are extreme opinions pro and 
con psychosurgery. Some opponents argue that psychosurgery in any 
form and for any behavioral or mental disturbance—regardless of its 
severity—should be outlawed. At the other extreme, some proponents 
argue that psychosurgical procedures should no longer be considered 
research but constitute routine and accepted treatment for a diversity 
of behavioral disorders ranging from childhood hyperkinesis to homo- 
sexuality. A middle stance would hold that psychosurgery might be of 
potential benefit and should be studied only for its use in cases of extreme 
mental illness, especially those that are life-threatening to the patient 
or those around him and that psychosurgery should be performed only 
after extensive attempts at alternate modes of therapy had failed. We 
believe that knowledge and understanding of the relation between the 
brain and behavioral abnormalities and of the optimal treatment of such 
abnormalities are still in very early stages. Even though there has been а 
vast amount of recent animal research, our knowledge of brain function 
is still insufficient for a clear understanding about what parts of the brain 
control what behaviors. In the literature on animal brain research, there 
are conflicting results and many controversial issues that will require 
much more basic research to resolve. It ean be argued that if we cannot 
really know how the brain works in animals, it is unjustifiable to try to 
manipulate human brain function. Also, even if we understood in con- 
siderable detail how the monkey, cat, or rat brain works, this under- 
standing could not be easily and uncritically generalized to the human 
brain. Although present evidence indicates that the basic principles of 
brain function are the same in animals and humans, evolutionary changes 
or other factors may be such that a given subdivision of the brain may 
subserve different functions in animals and humans. 

Another realm of criticism stems from the ethical and moral issues 
involved in changing or controlling the behavior of individuals. Psycho- 
surgery, because of its dramatic nature, has become a 
for concern and criticism about behavioral control. Alth 


clear and valid concerns about intrinsic i 
kind of behavior 


“lightning rod” 
ough there are 


Ssues of psychosurgery, this 
control has attracted and focused a diffuse “static” of 
concern that biomedical and behavioral sciences are eroding the freedom 


and dignity of man. Behavior control has always been exerted throughout 
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human history in the form of codification of laws, reward and punish 
ment in social and familial context, religious systems, and сазвала! 
systems. However, when the techniques of behavior control become 


explicit and when an advancing behavioral and medical technology makes 


it more possible to exert behavior control, the possibilities for abuse 


ume more likely. Although psychosurgery is а powerful means of 
nging behavior, the ethical problems posed by the modification of 
behavior with drugs or psychotherapy are probably of greater im зз 
because these means of control are available to a much wider ran; * f 
potential abusers than are the techniques of psychosurgery. d 
These developments, and other advances such as techniques for the 
prolongation of life, bring with them ethical complexities heretofore 
unheard of. Medical ethics has never been, as some mistakenly believe 
a matter of good versus evil or right versus wrong. It has always been a 
matter of when, where, and who draws the line as to whether a given 
medical procedure is helpful, harmful, or warranted and whether the 
potential benefits of any course of treatment outweigh the risks. 
| While, on the one hand, society is clamoring in certain fields for the 
right of every individual to treatment, an almost equally loud ery is be- 
ginning to be heard about the individual's right to death. Even more 
complicated ethical issues arise when the patient is part of a scientific 
study. While good clinical practice demands that the individual always 
be a patient first, scientific studies often demand the withholding of an 
experimental treatment from some patients while providing it to others 
in order to compare the effectiveness of the new treatment with older 


treatments or with no treatment. However, there are those who dispute 


the right to withhold treatment of potential value from anyone under 
any conditions. 


There are no simple answers to these problems. There is an increased 
public awareness of the extent and nature of these difficulties and some 
enthusiasm for the establishment of bioethics commissions which would 
serve as watchdogs and would devise suitable guidelines. Still others 
believe that it is best if the professions involved establish their own 
ethical criteria and monitor their own ac iviti middle stance would 
hold that while it is most appropriate for the variou 
tablish their own е in all instances t 


thical principles, 
with the involvemen 


t of other profe: 
sentation of laymen and consumers. 
In addition to 


these ethical concerns, 
poses a difficult problem. Informe г ] 
the patient who is about to undergo а medical or surgical treatment that 
exposes him to any degree of risk is fully 
risks involved and has a clear understanding of the procedures : 
This also applies to any person W is a subject in а medical or behavioral 
experiment in which there is any 
chological effects. 
Obtaining adeq 


ssions and with appropriate repre- 


uate informed consent is difficult even when the human 
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subjects or patients are of normal intelligence and are emotionally healthy 
and mature. It becomes infinitely more complicated when the subjects 
are seriously disturbed, below normal intelligence, or children and when 
the proposed procedures involve serious risks and irreversible conse- 
quences. But a clinical investigator or practitioner may not avoid the 


solemn responsibility of securing informed consent merely because it is 
difficult. 


DEPARTMENT OF HEALTH, EDUCATION, AND WELFARE POLICY 


Much of the biological, medical, and behavioral research conducted in 
this country is supported by funds from the Department of Health, Edu- 
cation, and Welfare. Recognizing the problems involved in research with 
human subjects, the department has established a Branch on Institu- 
tional Relations within the National Institutes of Health. This branch 
has the responsibility for formulating and overseeing policies intended 
to safeguard the rights and welfare of human subjects involved in activi- 
ties supported by grants or contracts from the Department of Health, 
Education, and Welfare. Current DHEW policy, outlined in a pamphlet 
entitled The Institutional Guide to DHEW Policy on Protection of Hu- 
man Subjects (1972), outlines three cardinal principles. They are, briefly: 
that the rights and welfare of the subjects involved are adequately pro- 


tected, that the risks to the individual are outweighed by the potential 


benefit to him or by the importance of the knowledge to be gained, and 
that informed consent is obtained in an adequate and appropriate manner. 
DHEW guidelines describe the basic elements of an informed consent 
as follows: 
1. A fair explanation of 
identification of those which are experimental; 
2. A description of the attendant discomforts and risks; 
3. A description of the benefits to be expected; 
4. A disclosure of appropri: 
advantageous for the subject; 
5. An offer to answer an 


the procedures to be followed, including an 


ate alternative procedures that would be 
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guardian? (2) under v hat circ S! а. consent 
Ё € W. ances сапа substitute о 

2 umst г agen 

instead of the patient? (8) тау раг ents ever consent to a risky procedure 


on behalf of a child when the benefit is primarily for science and the 


benefits for the child are minimal? 


The DHEW guidelines require that any institution е i 
5s mre на ре research involving human бааа 
| professional committee with responsibility for reviewing any applica- 

ion for support of such research to assure that the research protocol 
adequately fulfills the policy for protection of the subjects. The тоне 
of the DHEW guidelines and the required institutional committees diss 
produced a beneficial spill-over of ethical poliey to research and а. 
ment activities not supported by DHEW. 

The National Institutes of Health have established a study group to 
review the current DHEW policy in the light of the many issues that are 
becoming apparent in biomedical and behavioral research. The group 
will assess the effectiveness of existing policy in such areas as informed 
consent, institutional review committees, protection of subjects in pris- 
ons or mental institutions, and the NIH review of research applications 
involving human subjects. The rapid advancement of biomedical and 
behavioral sciences creates continuously changing ethical issues, and 
ethical policy cannot be static or fixed but must remain flexible so that 

s as they arise. 
ed the DHEW guide- 


Many hospitals and research institutions have us 
ructuring their own poliey for the protection of 


lines as a model for st 

human subjects. Some hospitals and institutions have been drawing up 

very detailed and comprehensive procedures for solving these questions. 

They require the presence of the subjects personal physician, personal 

lawyer, and immediate kin with periods of time being allocated for re- 

flection, questioning, and debate before consent is given. This is an area 
jk with our sister 


that will be receiving increased attention as we wor 
agencies to refine and expand the DHEW requirements for research on 


human subjects. 

With respect to the specific issue of psychosurgery, the NIMH and the 
National Institute of Neurological Diseases and Stroke have established 
an Inter-Institute Work Group on Brain and Behavior to examine а Vari 
ety of aspects of the proble i Work Group will be 


m. The Inter-Institute | 
responsible for the formulation of a comprehensive report containing 
recommendations and policies for further a 


S FOR THE FUTURE 


ctivities in this area. 


y current medical, biological, 


SAFEGUARD 
and moral problems posed Б 
able concern to government 


The ethical ‹ 
and behavioral techniques are of consider: cer me 
agencies as well as the health professions, the scientific communities, 
religious groups, the Congress, and the lay public. Knowledge, or at 
least the application of knowledge to wield political or social power, has 

mitant of scientific inquiry. 


always been а potentially dangerous conco 
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Even new knowledge which at first appears to have only beneficial con- 
sequences can create new problems. Potter in a book called Bioethics 
(1971) put it quite succinctly: *. . . once we have made the choice to open 
Pandora’s Box of Knowledge, we can never put back its contents, and 
mankind must continue to search forever for the wisdom that is needed 
to cope with the avalanche of new knowledge that is upon us.” We must 
somehow evolve a system of technology assessment whereby the state 
of knowledge can be assessed and its readiness for application can be 
addressed within the context of relevant social, human, and ethical is- 
sues. Such a system would assure the timely application of relevant tech- 
nology and help prevent the premature use of those techniques not 
thoroughly explored for their human and social implications. 

We believe that the answer to the dangerous use of knowledge is the 
creation of new knowledge, combined with a sensitive, rational, and 
humanitarian perspective on the applications of that knowledge. It is 
also important to broaden the training of scientists and practitioners 
to include such ethical issues and the humanities. 

With regard to the specific issue of psychosurgery, the first problem 
one encounters when attempting to deal with the ethical issues involved 
in this area is the lack of any reliable data on the extent of the practice, 
the type of patients subjected to psychosurgery, the particular types of 
surgery used, or the results on patient behavior. The estimates of the 
current number of psychosurgical procedures performed are probably 
inflated by the opponents of psychosurgery and minimized by the psycho- 
surgeons. The most systematic attempt (in this country) to assess the 
effects of psychosurgery on patient behavior and intellectual capacities 
was the Columbia-Greystone Project, which was conducted in 1947. Since 
then, many changes in psychiatric and medical practice have taken place: 
chemotherapy has dramatically changed the nature of the patient pop- 
ulation in mental hospitals, and new surgical procedures have been 
introduced which are quite different from those studied by the Columbia- 
Greystone Project. Without this kind of recent information, the current 
state of knowledge is insufficient to permit a clear-cut comparison of the 
potential benefits and possible risks of psychosurgery. We hope that the 
NIMH-NINDS Inter-Institute Work Group on Brain and Behavior will 
develop a mechanism for obtaining this kind of needed information. 

In recent years professional societies and private foundations have 
developed a variety of programs to deal with medical and behavioral 
ethics. One of the most active 


с t { private groups is the Hastings Center 
Institute of Society, Ethics and the Life Sciences. Founded in 1969 with 


support from several philanthropic organizations and individuals, the 
Institute has organized a variety of seminars on ethical issues, has estab- 
lished an experimental teaching program on medical ethics at Columbia 
University, and has a dedicated and professional staff conducting re- 
search programs on ethical issues in the areas of behavior control pop- 
ulation control, death and dying, and genetic engineering. The National 
Science Foundation has recently provided support for the Institute’s 
program of behavior control research. Organizations such as the Hastings 
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Center Insti ri i i 
Ld Шеше merit continued support, both from private and govern- 
The National Academy of Sciences—National Rese i 

established a Committee on the Life Sciences and эз) Кык ТЫЫ 
committee, composed of a prestigious group of lawyers. scientists a 
historians, is concerned with identifying the issues and problems emer, - 
їп from advances in the biological and medical sciences. The Committee 
has considered such issues as organ transplantation, behavior modifica- 
tion by drug administration to children, and fetal diagnosis and will issue 
a report of its deliberations in the near future. The prestige of the Na- 
tional шү of Sciences and the eminence of the Committee members 
assures that this group’s activities will be of i i i 
octet? E p e of considerable influence in 

| The American Psychological Association has just completed an exten- 
sive project to develop a set of ethical principles for the conduct of re- 
search with human participants. After four years of data-gathering from 
some 20,000 members of the association, a final report has been issued 
which goes into unprecedented detail on potential risks and recommen- 
dations for protection of human subjects. There has been a huge demand 
for reprints of this ethical code from other associations and private com- 
panies engaged in psychological research, and it promises to be very 
influential as a model for other behavioral sciences. 


CONCLUSION 
Continued research in medical science and the development of new 
medical technologies promise to present us with unprecedented ethical 
dilemmas that must be faced by the government, scientists, medical 
practitioners, and the public. The issues are and will be complex ones 
for which there are no easy answers, and deal with matters of law, phi- 
losophy, religion, science, consumerism, and public policy. The answers, 
when they do come, will not derive from either emotional attacks or a 
blind faith in the sanctity of science and medical practice but from a rea- 
soned analysis of all facets of the issues by concerned, sensitive, and ra- 
tional representatives of the diverse groups involved. The answers to the 
issues of psychosurgery will depend heavily upon scientific advances In 
our studies of the brain and behavior. Some day it may be possible to say 
tential benefits of 


that in the light of our expanded knowledge the po е 
human subjects— 


psychosurgery far outweigh any possible risk to the 
but that day is not yet. We are sobered by the thought of how very much 


more there is to know about that marvelous structure, the human brain. 
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BRAIN SURGERY IN AGGRESSIVE EPILEPTICS 


Vemon Н. Mark, Boston City College 
and Harvard Medical School 


A little over two years ago Frank Ervin, a psychiatrist, and I, a neuro- 
surgeon, wrote a book called Violence and the Brain, detailing the appli- 
cation of brain surgery to problems of violent behavior. The public 
response to that book underscores with great vividness the fact that 
the medical issues of neurosurgery are no more interesting or vital than 
the issues of neurosurgery’s social role. I would like here to offer some 
reflections about those social issues. 

The most important problem to conjure with, from the standpoint 
of the sciences of human behavior, is the unfortunate dichotomy in basic 
approaches to behavior. Certain kinds of behavior (for instance, paral- 
ysis, blindness, and dementia) were put into the province of organic 
neurology. Physicians working in this field have been increasingly reluc- 
tant to examine other kinds of abnormal behavior, even when they are 
associated with obvious brain abnormalities. 

On the other hand, social psychiatrists, sociologists, criminologists, 
and many psychologists tend to view the other behavioral abnormalities, 
for instance intractable depressions and aggressive assaultive behavior, 
as pure reflections of unusual or abnormal environmental stress. They 
believe that brain function or dysfunction is not an important determi- 


nant as far as abnormal behavior is concerned. To them 
function with the same “normal” 


ment for causes but also to the brain itself, 

Yet when President Lyndon Johnson and Milton Eisenhower convened 
the Commission on Violence, there were over fifty consultants repre- 
senting various fields of sociology, criminology, history, government, 
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law, social psychiatry, and social psychology, but only two representing 
the brain! 

Let me stress that recognition of the brain in behavior does not entail, 
as some critics might fear, that all behavior should be controlled through 
the brain. Rather it is precisely because any behavior, normal or ab- 
normal, could be modified through altering the brain, at least in principle, 
that it is absolutely crucial to make moral decisions about the sort appro- 
priate for neurosurgery and other direct brain manipulations. There are 
important moral issues at stake, for example, in the very definition of a 
problem as a medical one, for which medical therapy is appropriate. This 
problem is pressed further when a behavior problem is defined as a neu- 
rological one. 

My own position regarding the appropriateness of neurosurgery can 
be illustrated in terms of three alternatives. The first two of these I 
unequivocally reject: 

The first of these positions holds that medical means should be under- 
taken to improve any behavior, wherever possible, be the behavior 
normal or abnormal. Although neurosurgery is not in a position now to 
improve normal behavior, some people advocate the use of drugs—am- 
mines, for instance—for that purpose. The advantage of this posi- 
tion is that it avoids the obvious difficulties in defining normality. The 
grave disadvantage is that it makes the medical men authorities on im- 
provement and the good life, which they are not, at least under our pres- 
ent social arrangement. Moral values are social concerns, not medical 
ones in any presently recognized sense. | 

A second position holds that any abnormal behavior should be treated 
by whatever medical means are available. Often argued by psycho- 
therapists, this position says that, regardless of any organic abnormality, 
what is undesirable in abnormal behavior is the behavior, not its organic 
base. The criteria for alteration should refer to the behavior, irrespective 
of whether the organic base is normal or abnormal. Should an unusual 


brain abnormality produce a 50 point I.W. rise, no one would suggest 


ablating it: its behavioral product is desirable. (With regard to neuro- 
inciple that it 


surgical control of violent behavior, I am against the princi | 
could be used to treat abnormal behavior when there is no organic ab- 


normality of the brain.) 
A third position, then, 
should be used only when b 
an abnormality in the brain. 
associated with brain disease s 
not medically. А 
With this brief positioning of what I take to be support for a very oe 
cumscribed domain for neurosurgical procedures, I now want to dea 
with some common criticisms of neurosurgery. Most of the criticisms 
ore than it is or assume 


either construe neurosurgery to be capable of m i : 
that it would be used in areas I believe to be inappropriate. Like any 


technological power, neurosurgery can be misused, and setting the limits 
of its use is more than a medical problem. 


pheta 


is that medical procedures like neurosurgery 
ehavior is abnormal and bad, as the result of 
‘As I shall repeat below, violent behavior not 
hould be dealt with politically and socially, 


262 PSYCHOSURGERY: A MULTILEVEL CONTROVERSY 


POLITICAL VERSUS PERSONAL VIOLENCE 


Implicit in many of the criticisms of the surgical treatment of violent 
epileptics is the fear that this treatment will be used against political 
protestors. This fear is based on a semantic confusion about the word 
“violence.” Many activities in our society are called violent. The “estab- 
lishment” tends to view as violent any protest movement against the 
war, unfair living conditions, or the prison system. Protestors view the 
reaction of police and the National Guard as violent. Minority groups 
have indicated that social or job discrimination is a form of violence. 

The kind of violent behavior for which I might approve neurosurgery, 
however, has a much narrower definition: personal violent behavior; un- 
warranted and usually unprovoked acts that directly attempt to or actu- 
ally do injure or destroy another person or thing. And as I have said, 
I would not approve neurosurgery even in such cases unless the personal 
violence could be traced to organic brain disease and could not be treated 
by non-surgical methods. 

Some political psychiatrists have argued that social injustice provokes 
individual acts of violence which seem “senseless” but which in reality 
are political protests against the social system. As Goebbels showed, 
political violence of mass proportions can be culled up without counting 
on people with diseased brains. However, the person whose violence is 
related to brain disease and is used by a Hitler or Goebbels is a danger 
to himself and his loved ones and cannot be counted on to rage only in 
politically strategic situations. Thus the morality of preventing such an 


individual from seeking proper medical diagnosis and care is certainly 
in question. 


BIOLOGICAL FACTORS IN PERSONAL VIOLENCE 


Many sociologists believe that brain dysfunction does not have a role 
in such behavioral abnormalities as intractable depression, thought dis- 
orders, paranoia, or aggressive assaultive behavior of the sort I am con- 
cerned with. In the last case, however, there is solid medical evidence 
to link aggressive behavior to focal brain disease. This behavioral symp- 
tom is often present in such clinical disorders as temporal lobe epilepsy, 
temporal lobe tumors, infections of the brain (such as rabies and post- 
encephalitic syndromes), and serious brain injuries which affect the 
under-surfaces of the frontal lobes and the anterior tips of the temporal 
lobes (usually a transient phenomenon in the last disorder). 

f The most frequent occurrence of brain dysfunction in violent behavior 
is in brain poisoning, the most serious and ubiquitous of which is caused 
by alcohol. Alcohol is a Specifie brain poison, and individuals acutely 


poisoned with alcohol may have as much dysfunction during the poisoning 
as would be caused by a brain tumor, infection, or injury. The only dif- 
ference between the former and the latter is that alcohol poisoning is 
temporary. 
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" 2 m n ИЕ accidents caused by drunk drivers takes 
eater toll than the violence in political prot i 
drunk driver suffers from a tem b disi с 

к porary brain abnormality. Alth 
certainly do not advocate neurosurgery for the (елш of козга. e 
nor for all kinds of organic brain disease, it is clear that violence is some- 

times a function of brain abnormalities and should be viewed as such 


RACISM 


In the most avant-garde circles today the favorite critical epi i 
Шиш. Neurosurgery seems to draw its critics from ^ rd dn 
pi deer neurosurgical treatment of violent epilepties has been 

Speaking to a national group of psychiatrie investi 
approach to the problem of violent bees a елле hee 
chiatry conceded that the biological-social model of violence and the 
investigation of brain disease associated with violence had some merit 
But he classified our project with the proposal of those psychologists 
who claim that black people have a constitutionally inferior IQ compared 
to white people. Admitting that the racial idea might have some merit 
too, he felt that nevertheless funding and research should not be given 
to such projects until all possible avenues of social and educational re- 
habilitation have been exhausted. 

It seems to me there are really two issues here. The first concerns the 
political implications of a theory about violence. One of the functions of 
a theory is that it tells you where to look to find the phenomenon in ques- 
tion. Does the theory that some violence is caused by brain disease lead 
us to expect it is a characteristic mainly of black people? Certainly not. 


On the other hand, the theory that personal violence is caused exclusively 
by social conditions might very well lead us to look at the black ghettos. 
The environmental cues to personal violence may very well cluster around 


racially differentiated areas. 
The second issue 18 whether there is any empirical evidence that or- 
ganically based personal violence is more common among blacks than 
whites. In my experience there is no special correlation of violence with 
race. Domestic personal violence occurs in upper- and middle-class homes 
as well as in the ghettos. When it occurs in the ghetto, however, it may 
be more visible, spilling out into the streets from the very pressures of 
overcrowding. It may very well be that violence in ghettos more often 
gets reported in police records and other data available to the sociologist. 
But the perspective of the physician is the emergency room and clinic 
where the products of violence are immediate. From this perspective, 
claret is the predominant color, not black or white. у | 

It is my perception that the biological-social model of violence will not 
concentrate on one race, ethnic group, or other social segment of our 
population as do the subculture theories of violence. In fact, I feel it is 
important in cases of violence related to brain disease to study the violent 
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act primarily and to consider the style of violence, which is socially deter- 
mined, as a secondary phenomenon. 


DANGERS IN A MEDICAL MODEL OF VIOLENCE 


All sorts of dire predictions have been made about the outcome of brain 
research directed toward the understanding and treatment of emotional 
disorders and especially those related to violent behavior. Shades of 
1984! “The neurosurgeons want to put electrodes in everyone’s brain to 
keep them from protesting!” “They’re going to develop a new drug that 
will completely destroy the will!” “Man’s dignity will be shattered, and 
his innate human quality will be destroyed!” These would indeed be 


serious statements if they were true. But it is important to see just what 
the issues are. 

It is one thing to advocate neurosurgical procedures for certain kinds 
of violent behavior caused by organic b; 


quite another to advocate them as general methods of behavior control. 


immoral ends by unscrupulous people. No gr 
that provided by Nazi Germany. 
But interestingly enou 


mental ones, not direct alterations of the brain. 


systems, 
ight have reduplicated Skinner’s experi- 


ced brain electrode technology of the 
sophisticated group of psychic drugs 
that | more perfect form of behavior control than that 
initiated by environmental “natural” factors in the Third Reich? 


One of the important factors to emerge from our own research on the 
relative influence of deep el 


E ce of de ectrieal brain Stimulation versus environ- 
mental "natural" stimuli is th 


subject a demanding and attention-absorbing task during electrical brain 
stimulation. 


My thesis that behavior control through the direct manipulation of 
the brain is not as dangerous for mass abuse as environmental forms of 
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Г ни е иа E Pod that it has no dangers whatsoever 
Я е soci: itation ү 

or administers drugs on whom and Spl captas ре nci 

through publie discussion and social decisions. My own бака 18 th t the 

dangers of mass abuse are not sufficient to warrant preventin the < 

development of е techniques that might have very great ther bi 

value for patients with organic brain disease. = 


MORAL PROBLEMS IN NEUROSURGERY 


If neurosurgery is not much of a danger as a political tool, there still 
"Y еи problems with its medical use. I would like to address two 

First is the problem of neurological diagnosis. The em i 
been placing on the neurological side of he конен of AR ve 
be taken out of the context of environmental factors also. Due to the ve 
fundamental problem of specialization in medicine, it is very difficult r4 
maintain a biological-social model of violence in actual practice. The 
neurologists are in one place, and the psychiatrists and social workers 
are in another. 

Yet if the model is worthwhile, treatment of a patient should involve 
not only his brain but his family, living conditions, job, and role in society. 
It is very important, therefore, to embed a neurological diagnosis of prob- 
lems of violence into a larger integrated approach to human behavior. 
The social forms for this holistic therapy are yet to be worked out, al- 
though my own group has tried to include psychiatrie and social psychi- 


atric diagnosis along with the neurological. 
From my reading of the present situat 
greater danger is that the neurological sid 
for example, an airplane hijacker killed one 
d with a blow to the head. Examination of this 


he himself was subdue 

man afterward revealed he had been a neurological cripple for eleven 

years, ever since receiving a gunshot wound of the brain. Yet repeated 

tests, including hours of brainwave examinations, psychological testing, 

psychiatrie and neurological assessment, failed to reveal the tremendous 

damage that had occurred in this man's emotional center or limbic brain. 
had their classic 


ion, however, I believe the 
e will be left out. Recently, 
pilot and shot another before 


The difficulties of diagnosis put tragic expression 
in the case of George Gershwin. This talented composer was seen and 
treated psychiatrically for a long time while a tumor in the limbic system 
of his brain grew to an unmanageable and untreatable size. The George 
Gershwin syndrome of the thirties is still being treated in the seventies. 
Recently eighteen patients were committed to a mental hospital at one 
of our best university centers who turned out to have tumors of their 
limbic brains. In some cases the true nature of this illness was not recog- 
nized until the tumor had caused the patient’s death. 

Second, after the diagnosis of brain disease has been confirmed, what 
are the problems of proper consent to treatment? Does the patient con- 
sent to the treatment? Does he know the dangers of the treatment and 
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how those dangers compare with letting him pass without treatment? 
What are the wider factors involved in informed consent when the patient 
is mentally incompetent?! 

Usually close members of their families are the sources of consent. But 
in cases of surgery for violence, I believe the patient and his family should 
have the assistance of an impartial, noninvolved professional group to 
determine whether surgery or other forms of treatment should be under- 
taken. In practical terms this means that a committee of some sort, 
composed of physicians, or in some cases, physicians and informed lay- 
men should be present to monitor the medical, psychiatric, and/or surgi- 
са] treatment.? 

In my own practice, I and my group do not accept patients for treat- 
ments who do not want therapy, and we do not believe that the public 
good or public interest should intrude upon the personal medical model 
in terms of protecting the public against violent individuals. 


FREE WILL AND BEHAVIOR CONTROL 


Radical critics of the biological- 
human violence to be an express 
consider the medical correction of 
result, stop violent episodes to 
on human dignity. As a physici 
priate, not because I deny fre 
human life. 


Many of the patients who come to us with focal brain disease asso- 
ciated with violent behavior are so offended by their own actions they 


have attempted suicide, They feel their human dignity has been lost 
precisely because of uncontrollable behavior patterns they find unnatural 
and repugnant. 


social model of violence often construe 
ion of free will. In line with this they 
brain disease that would, as asecondary 
have an unnatural and degrading effect 
an I find this view particularly inappro- 
е will but because I prize the quality of 


own behavior. It enhane 
and does not detract fr 


Finally, it is appropriate to ret 


es ior control of one individual by 
another more difficult or even impossible. 


In other words, one of the expected benefits of increased brain re- 
search will be the creation of new techniques enabling each individual 
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better to control or govern his or her behavior, without unwarranted or 
unwanted interference by other individuals or devices. 

There is already enough knowledge of environmental and psycho- 
pharmacological drugs to control a vast segment of our population with- 
out invoking brain surgery or electrical brain stimulation. The great 
hope of emotional brain research is that it will free us from our present 
tyrannies and future dangers of control. To this end, brain scientists 
need the help of philosophers, ethicists, theologians, social scientists, 
and jurists, working in concert. 


NOTES 


1, Prison inmates suffering from epilepsy should receive only medical treatment; surgical 
therapy should not be carried out, because of the difficulty in obtaining truly informed 
consent. 

2. In conjunction with Dr. David Allen, we are exploring “consumer advocacy” utilizing 
a group with religious, legal, and community representatives in addition to physicians, 
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